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Measures of Stuttering Adaptation 


BRUCE QUARRINGTON 


Since its demonstration (6), the adap- 
tation phenomenon has attracted the 
interest of many investigators of stut- 
ering. ‘Adaptation’ refers to the gen- 
eral tendency of stuttering frequency 
to decrease in a positive decelerating 
manner as oral reading of a given prose 
passage is repeated. It has been shown 
that this effect is not confined to prose 
material but occurs also in repeated 
readings of a word list (4). It is also 
known that repetition of fixed material 
is not essential for frequency reduction. 
Subsequent sections of long prose pas- 
sages when read aloud show trends of 
decreasing stuttering frequency (2, 7). 
It would appear, however, that reduc- 
tions in frequency are most marked in 
repeated oral readings of prose, and this 
has formed the standard way of dem- 
onstrating the effect. 

The length of prose passage em- 
ployed seems not to affect the mean 
rate of adaptation shown by a group of 
stutterers (12). The time lapse follow- 
ing adaptation trials required to bring 
about spontaneous recovery of initial 
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stuttering level has been established 
(5, 8). The effect of differences in the 
social complexity or the anxiety-induc- 
ing character of the adaptation situa- 
tion is not clear. Different experiments 
here have yielded contradictory results. 
Dixon (1) compared adaptation meas- 
ures from reading situations varying 
mainly in their anxiety-inducing char- 
acter and concluded that adaptation 
rate was increased with increases in 
situational anxiety. This increase, the 
author notes, is largely carried by the 
increased initial frequency of stutter- 
ing in the anxiety-laden situations. Shul- 
man (/2) compared adaptation meas- 
ures collected in a standard adaptation 
experiment with those obtained from 
a situation in which the number of 
listeners was increased by one with 
each successive reading until a total of 
five persons including the experimenter 
comprised the audience for the fifth 
reading. His findings indicate that the 
complex situation depressed adaptation 
rate when compared to the standard 
situation. 

The adaptation experiment has been 
regarded also as something of a model 
of the treatment situation. It has been 
suggested that changes in the direction 
of greater adaptation may reflect the 
degree of improvement in adjustment 
to stuttering (/3). It is possible also 
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that adaptation measures may be of 
value in predicting individual response 
to treatment. Some stutterers show a 
remarkably stable level of stuttering in 
reading after reading. One would be in- 
clined to suspect that this inflexibility 
manifest in the adaptation test situation 
is likely to be characteristic over a 
longer period of time and to indicate 
a rather poor treatment prognosis. 
Marked reductions in stuttering fre- 
quency on repeated readings suggest 
that the stuttering of such an individual 
may be very responsive to increases and 
decreases in the affect-inducing aspects 
of the total situation. In such an indi- 
vidual any treatment resulting in de- 
creased emotionality might be expected 
to result in decreased stuttering (//). 
As far as the present writer is aware, 
attempts to study relations between the 
degree of adaptation manifest and 
treatment result have not yielded sig- 
nificant relationships. Indeed the litera- 
ture gives little information as to any 
relationship existing between adaptation 
and other measures of individual dif- 
ferences. It is possible that one of the 
reasons for the variable-in-a-vacuum 
status of adaptation is that methods of 
deriving a measure of adaptation are 
inadequate. The present paper is an 
exploration of this possibility with 
suggestions for the development of a 
more useful measure of adaptatjon. 
The most frequently employed for- 
mula for determining a measure of 
adaptation is one based on relative de- 
crease from one reading to a subse- 
quent reading (7). If the percentage 
of stuttering on the various readings 


is denoted by p1, peo, ps, ... , pj this 


measure of adaptation (Ar) is given as 
follows: 


Pi — Pn 
‘ar 
Since relative decrease is usually meas- 
ured from the frequency on the first 
reading, p; is given in the formula. The 
subsequent reading involved in this 
adaptation measure is often the fifth 
reading, but p, is used to indicate that 
any subsequent reading might be em- 
ployed. The subscript 7 is added to Ar 
to indicate which reading was em- 


ployed. 


Ar, = (1)? 





Various workers have pointed out 
that measures of adaptation from this 
method are negatively correlated with 
the frequency of stuttering on the 
initial reading. Shulman (12) gives a 
product moment coefficient of —.64 
which is a somewhat higher value than 
those from other studies (10). In the 
experience of the author the coefficient 


.is usually between —.40 and —.50. 


It does not appear advantageous to 
define adaptation by a measure thgt is 
so dependent on initial frequency: The 
concept of adaptation would seem of 
greater potential value if it were defined 


by a measure that is independent i i 


initial status. It is known that stuttering 
frequency in most stutterers will vary 
considerably from day to day, and 
from situation to situation. There is the 
expectation that stuttering frequency 
will rise and fall with variations in the 
emotional state of the stutterer. With 


*The notation of Johnson and Inness (7) 
has been changed here in the interest of 
clarity and consistency. 
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regard to adaptation, however, the ex- 
pectation is toward stability. If adapta- 
tion is a measure of ability and interest 
in improving speech performance, then 
it should be relatively constant over 
short periods of time regardless of the 
particular emotional significance of the 
test situation. To employ a measure as- 
sociated with initial frequency incor- 
porates a source of unreliability fn 
measures of adaptation. 

It should be pointed out immediately 
that using the absolute decrease in stut- 
tering frequency does not make adap- 
tation independent of initial frequency. 
This measure, Aa, which has been 
found by the present writer to corre- 
late with p; between .40 and .50, is 
defined as follows: 


Aa, —— Pi <= Pn- (2) 


This problem of dependence on ini- 
tial status of the usual adaptation meas- 
ures was recognized by Oxtoby (9, 
10) who proposed the following solu- 
tion. She sought to determine the most 
probable relative decrease for varying 
stuttering frequencies on the first read- 
ing. An individual’s adaptation score 
would then be determined by dividing 
his observed relative decrease by the 
expected relative decrease for individ- 
uals having the same initial frequency. 
Oxtoby also introduced the idea of 
multiplying this ratio by a constant in 
order to make interpretation somewhat 
easier. The constant employed was the 
expected relative decrease when ); 
equaled 10%, which she felt was the 
typical initial frequency of stutterers in 
her experience. 


The formula for this measure (Av) 
is given as: 


Av, = V (pi — pn) (3)? 


where V a . (3a) 


Pp 
1 


In this last equation k is a constant 
serving as an arbitrary standard. It is 
the experimentally obtained expected 
decrease p; — p» where p; = 10%. The 
denominator D, -in formula (3a) is the 


expected decrease from Ist to the mth 
reading for stutterers evidencing an 
initial percentage of p;. This, of course, 
is calculated from the regression equa- 
tions she derived for her groups. 

Oxtoby found that this measure of 
adaptation (Av;) made certain distinc- 
tions between her experimental condi- 
tions that Av; did not. The method 
employed in Av strikes one as being an 
efficient way of bringing about inde- 
pendence from p;. There is, of course, 
the difficulty that Oxtoby’s regression 
equation is appropriate only for her 
group and that new equations must be 
developed for use in other studies. This, 
however, is but a temporary difficulty 
and stable regressions would probably 
be derived with more extensive use of 
the method. 

There are certain inadequacies in- 
herent in Av that must be pointed out. 
Examination of Oxtoby’s data* indi- 


Original notation has been altered for 
purposes of presentation in this paper. 

*All references to Oxtoby’s data indicate 
the use of percentages of stuttering for 24 
subjects on five successive readings under 
her ‘control’ conditions. These are given in 


Table A of her appendices (9). 
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cates that the variance of pi; — po, 
pi— ps, Pi — Ps, Pi — ps (hence also 
po, Ps, Ps, Ps) increase with p;. This 
does not appear to be peculiar to these 
data; it has occurred in all samples of 
adaptation tests collected by the pres- 
ent writer. 

The presence of heteroscedasticity in 
these data indicates that serious limi- 
tations are imposed on the predictive 
and comparative use of the derived 
regression equations. More immedi- 
ately, it is evident that, due to the 
increasing variance noted, individuals 
with high p; values will tend to receive 
the extreme values of adaptation. If 
Oxtoby’s data are examined for this re- 
lationship, the absolute deviations from 
the expected values are found to be 
positively correlated with p; to the 
extent of .21 to .35 (Avo,..., Avs). 
One may also look on this as indicating 
that the reliability of the adaptation 
measures is varying inversely with in- 
itial frequency. The arguments invoked 


earlier in favour of a measure of adap- . 


tation independent of initial status 
apply here. One solution appears to be 
a transformation of 1, Pp, ..., Pn, that 
will render variance uniform. Several 
transformations were tried out on Ox- 
toby’s and other data, the most satis- 
factory being the angular transforma- 
tion: ¢ = 2 arcsin ¥ p where p = per- 
centage of stuttering (3, 14, ch. 17). 
Henceforth ¢1, ¢2,..., ¢» indicates the 
angular transformation of 4, po, ..., Pn- 

The measure of adaptation suggested 
here (A¢) is based on the deviation of 
each particular value of ¢, from the 


A 
predicted value ¢, derived from the 
linear regression line of ¢, on ¢1. The 
formula is then given as: 


Agn = on— Gn (4) 


With uniformity of variance there 
is no need for a ratio; a simple differ- 
ence suffices. It might also be noted 


that the correlation of ¢; with i ~ tis 
equals zero, which may be proved 
algebraically.‘ 


The specific regression equations for 
$2,.-., ¢5 derived from Oxtoby’s data 
are offered below: 


go = .938¢1 — .0728 
éy =3 S124, — 0575 
ds = 860d, — .0119 
$s = 84941 — .0033 


A 
For purposes of comparison ¢2 and 
A . . 
$3 regression equations derived from 


another source are offered. Sixty adult 
stutterers applying for treatment at the 
Speech Clinic of the Toronto Psychi- 
atric Hospital were required to read 
a prose passage (‘Arthur the Young 
Rat’) three times. This adaptation test 


‘The following proof has been offered by 


Dr. Leonard Feldt (x has been used instead 
of ¢:, y instead of gn): 
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was administered during their investi- 
gation period prior to acceptance for 
treatment except in 12 cases for whom 
treatment was in progress. The instruc- 
tions were essentially those employed 
by Oxtoby in her control conditions. 
The regression equations obtained 
were: 


$2 = 99741 — 1041 
de = 9604, — .1281 


It will be noted that both these equa- 
tions are more oblique and positively 
displaced on the dependent variable 
than the corresponding equations from 
Oxtoby’s study. The displacement dif- 
ferences indicate that Oxtoby’s subjects 
apparently showed greater decrements 
in stuttering frequency than those in 
the second study. These differences 
may be attributable largely to selection 
factors operating in the two samples. 
In Oxtoby’s study the subjects were 
actually under treatment, whereas in 
the second study applicants for treat- 
ment constituted the bulk of the 
sample. Many of these subjects were 
not considered good candidates for 
treatment and were not accepted for 
therapy. Those rejected were, as a 
group, lower in adaptation than those 
accepted, although the decision to 
accept for treatment was made without 
reference to performance on the adap- 
tation test. 

The differences in the regression 
equations are not great despite differ- 
ences in sample character (and other 
differences such as length of prose 
passage) which fosters the expectation 
that further studies will yield adequate 
normative regression equations. 

Even after a way has been suggested 


for bringing about independence of an 
adaptation measure and initial fre- 
quency, there remains the task of dem- 
onstrating some increased utility of 
this measure over other measures. Al- 
though some evidence will be offered 
later indicating advantages of Ad, the 
major burden of proof has yet to be 
taken up. At this point there appears to 
be some merit in turning to another 
problem in connection with adaptation 
measures to see what help, if any, the 
measure A¢ offers. 

The problem is which subsequent 
reading (Pn) is to be employed in de- 
termining a measure of adaptation. As 
mentioned earlier, the fifth reading is 
most frequently employed. The reason 
appears to be that this is a critical bal- 
ance point of reward in terms of ob- 
served reduction in stuttering frequen- 
cy and effort involved in administering 
the test. Beyond the fifth reading, group 
adaptation curves plotted in percent- 
ages flatten out and the test situation 
becomes markedly more tedious for 
both subject and observer. Another 
reason for choice of the fifth reading 
may be that Ar; distributions are usu- 
ally normal, whereas Ars, Ars, Ar, dis- 
tributions are often positively skewed. 
Distributions from measure A7g,..., 
Ar, usually show negative skewness. 
Neither are compelling reasons for the 
selections of any particular reading as 
Pn. The only reason that would be com- 
pelling is the demonstration that adap- 
tation measures based on a particular 
reading yield stronger relationships 
with other variables of interest (for 
example, outcome of treatment). Lack- 
ing information of this sort, it is neces- 
sary to assume that any reading subse- 
quent to the initial one offers as good a 
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TaBLeE 1. Comparison of intertrial correlations 
of two adaptation measures (Ar, 











Ady Ads Ady 
914 .796 .824 Ads 
Ar 664 117.768 =Ady 
Ars .770 .696 716 Ads 
Are .580 .638  .659 
Ars Ar, Ars 








basis for measuring adaptation as any 
other. It is possible, of course, tempo- 
rarily to settle on a particular reading 
to be employed by studying the inter- 
correlations of adaptation measures 
based on different choices of p, and 
selecting that reading having the high- 
est set of correlations. Presumably this 
is the measure with the highest loading 
of an adaptation factor, or may be 
taken as such until demonstrated other- 
wise.° 

The intercorrelations of Ago... , 
A¢; and Ars,..., Ars were determined 
using Oxtoby’s data. These are given 
in Table 1 with the A¢ intercorrela- 
tions in the upper-right half of the 
matrix and the Ar intercorrelations in 
the lower-left position. 

Inspection of the A¢ intercorrela- 
tions shows a simple structure with 
the correlations, with one exception, 
steadily declining from Ag; to Ado. 
One would conclude, considering only 
five readings, that the fifth offers the 
best basis for deriving adaptation meas- 
ures. When the Ar intercorrelations 
are considered, structure is ambiguous 
but aparently indicates the superiority 
of the third reading. The lack of sim- 


*At the present time combinations of more 
than two readings are not considered as a 
means of deriving adaptation measures, al- 
though possibilities here are clearly worth 
investigation. 


plicity in the pattern of correlations, 
however, suggests the intrusion of ex- 
traneous factors in Ar measures. This 
is also suggested by the lower coefhi- 
cients of Ar intercorrelations when 
compared to those of Ag measures. 
Without exception A¢ correlations 
exceed those of their corresponding 
Ar correlations. These observations 
must be considered quite tentative, of 
course, in view of the small number of 
cases (24) on which the correlations 
are based. 

An indication of the possible value 
of A¢ will be offered in connection 
with the matter of retest reliability. It 
has been pointed out that initial fre- 
quency of stuttering may be expected 
to vary considerably in a stutterer 
from one testing to another. This sug- 
gests the Ad measures should be supe- 
rior to Ar measures when evaluated in 
terms of test-retest correlations. 

A study of 14 adult stutterers was 
conducted. Two adaptation tests were 
given to each subject from one to eight 


‘days apart. Two different 180-word 


passages, comparable in word difficulty 
and frequency of initial sounds, were 
employed. One passage was adminis- 
tered by the speech therapist in charge 
of stutterers’ pretreatment investiga- 
tion, while the second passage was 
administered by a member of the psy- 
chology staff with whom the stutterers 
were not so familiar. 

The intention was to compare the 
performance of Ar and Ad measures 
under two test conditions which varied 
in time, prose material, and particularly 
in the affect-inducing character of the 
test situation. Under these conditions 
considerable intraindividual variation 
among the subjects in level of stuttering 
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frequency would be expected. The 
product moment correlation coefficient 
of initial frequencies was in fact .59. 
Turning to adaptation measures, the 
test-retest correlations for these 14 
subjects were A¢3=.69 and Ar;—.42. 
This does not constitute a demonstra- 
tion of adequate test-retest reliability, 
nor was this the intention. The results 
do suggest, however, that studies which 
control conditions in the interest of 
showing the reliability of adaptation 
measures may find Ag measures of 
greater value than Ar measures. 


Summary 


Relationships of adaptation measures 
to other variables of individual differ- 
ences among stutterers are lacking in 
the literature. It is conjectured that 
this may be due to inadequate measures 
of adaptation and accordingly several 
existing measures are examined in this 
light. A likely source of error appears 
to be the dependence of adaptation 
measures on frequency of stuttering 
on the initial reading of the adaptation 
experiment. 

An adaptation measure (A¢) inde- 
pendent of initial frequency is proposed 
that involves the angular transforma- 
tion of stuttering frequencies. Devia- 
tions from the expected level of stut- 
tering are used as adaptation scores. 
The expected scores are determined 
by reference to experimentally ob- 
tained linear regressions of the amount 
of stuttering of subsequent readings 
on the amount of stuttering of initial 
reading. Some examples of regression 
equations are offered. 

Under experimental conditions ac- 
centuating intraindividual variability 
of initial stuttering frequencies an Ag 


measure was shown to have a higher 
test-retest correlation than a measure 
based on simple relative decrease. 
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Pitch, Intensity, and Cleft 


DONALD A. HESS 


Hypernasality, breathiness, and hoarse- 
ness, according to McDonald and Baker 
(16), are voice quality characteristics 
which may be discernible in the voices 
of cleft palate speakers. Berry and Eis- 
enson (2) include harshness as a pos- 
sible vocal concomitant of cleft palate 
speech. Since these four terms have 
been used in some speech correction 
texts (6, 12, 28) to describe most of 
the types of perceptual voice-quality 
aberration, it would appear that the 
manner in which cleft palate voice may 
deviate from normal is highly diversi- 
fied. 

That a significant degree of nasality 
associated with cleft palate may re- 
sult from morphologic involvements 
in the speech structures, particularly 
the velum, is well recognized, although 
a number of functional factors have 
been discussed (13, 16). Hoarseness 
among cleft palate speakers has been 
explained as a result of chronically in- 
flamed and swollen tissues (2, p. 330; 
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Palate Voice Quality 


16), habitual glottal articulation (22, 
p- 165), and ventricular voice (2, p 
329). Breathiness in cleft palate pecth 
may be a vocal consequence of ‘faulty 
phonation’ (16), while harshness is be- 
lieved by other writers (2, p. 326) to be 
the result of lingual, pharyngeal, and 
laryngeal muscular imbalances asso- 
ciated with glottal articulation. Van 
Hattum (26) discovered that his cleft 
palate subjects used a greater percent- 
age of their available air supply during 
phonation and had lower vital capac- 
ities than his normal subjects. It seems 
reasonable that some degree of deviant 
voice quality may cede | from the man- 
ner in which cleft speakers learn to 
use their speaking voices. 

Vocal pitch and intensity have been 
found to relate functionally to prob- 
lems of breathiness (6), harshness (6), 
and hoarseness (19, 20, 30, 33). Opin- 
ion and evidence from certain writers 
(5, 29, 32) suggest that vocal weakness . 
and nasality are functionally related. 
Statements (7, 8, 9, 10, 11, 14, 18, 21) 
about the effect of pitch on the percep- 
tion of nasality are less conclusive, 
however, and one objective study of 
functionally nasal speakers (24) failed 
to discover a relationship between pitch 
level and perceived nasality. 

Kantner (14) stated that the percep- 
tion of nasality must remain the major 
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basis for discussing it. His observation 
probably holds true for other types of 
voice quality disturbance as well. With 
special reference to nasality, Johnson, 
Darley, and Spriestersbach (13, p. 71) 
have questioned the wisdom of making 
inferences about cleft palate voice from 
research findings based on subjects with 
functional etiologies. 

On the basis of the literature, it thus 
appeared that an objective investiga- 
tion of the effects of pitch and inten- 
sity on perceptible cleft palate voice 
quality was indicated. The problem, 
then, was to study the effects of vocal 
pitch and intensity on the degree to 
which nasality, breathiness, harshness, 
and hoarseness are perceived in the 
voices of cleft palate speakers. 


Procedure 


Preliminary Considerations and Stud- 
ies. The experimental plan included (a) 
instructing cleft palate speakers to vo- 
calize at different pitch and intensity 
levels, (b) recording their vocaliza- 
tions, and (c) obtaining auditor judg- 
ments with respect to nasality, breath- 
iness, harshness, and hoarseness of the 
recorded voices. 


Subject Delimitations. To avoid pos- 
sible extraneous variation undesirable to 
the purposes of the study, the subjects 
were limited to male cleft palate speak- 
ers who had undergone voice change. 
Other requirements for their selection 
were that they (a) possess borderline 
or higher intelligence, (b) pass a sweep- 
frequency screening, hearing test (20 
db) in the better ear for 500-4000 cps, 
and (c) perform successfully in the 
final test conditions. No criterion for 


degree of vocal defectiveness was used, 
nor were the subjects limited to any 
particular type of palatal restoration. 


Experimental Speech Sample. The 
disadvantages of a connected speech 
sample to obtain judgments of voice 
quality have been discussed by Sher- 
man (23). Moreover, it was necessary 
to use a speech sample which could be 
more easily controlled for conditions 
of pitch and intensity. For these rea- 
sons, isolated, sustained vowels were 
used. Six vowels—[i], [ul], [el, [ol, 
[wz], and [a]—were selected to repre- 
sent the high, intermediate, and low 
positions of the front and back vowels. 
In a pilot study it was found that voice 
quality could be most effectively 
judged when these vowels were sus- 
tained for about 2.5 seconds. 


Experimental Pitch and_ Intensity 
Levels. To ensure that the findings and 
conclusions of the study might be more 
directly applicable to clinical problems, 
it was felt that the test pitch levels for 
each subject should be based on_ his 
own habitual pitch. It was hoped that 
the present study could avoid the atyp- 
ical vocal results that the use of stand- 
ard pitch levels might involve. On the 
other hand, standard intensity levels 
which could be attained by all subjects 
were desired. Since Cobb and Lierle 
(4) had obseived that cleft palate 
speakers are dificient in ‘pitch varia- 
tion and intensity’ during connected 
speech, it was necessary to discover 
how capably the subjects could alter 
their vocal intensities at different pitch 
levels. 


In a pilot study 20 male cleft palate 
speakers were tested to determine their 
ability to phonate the six vowels 
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Ficure 1. Simplified diagram of the equipment used to test the subjects and obtain the recorded 
vowel phonations. Key: SR, spacing rod used to control mouth-to-microphone distance at six 
inches, VU, VU meter, M, microphone; ‘SPM, amplifier rigged to act as a sound pressure 
meter; TR, tape recorder; SG, signal generator; SHP, subject’s headphone; EHP, experimenter’s 


headphone. 


through ranges of 90 to 210 cps and 65 
to 100 db* SPL (re: .0002 dyne/cm?) 
at a mouth-to-microphone distance of 
6 in. It was found that a majority of the 
subjects could phonate the test vowels 
within a pitch range between habitual 
pitch (in cps) and a higher pitch level 
1.4 times the numerical cps value for 
habitual pitch. Within these pitch 
ranges they could phonate the vowels 
between 75 and 85 db SPL. These ex- 
tremes were thus selected as the experi- 
mental pitch and intensity levels. Since 
Steer and Tiffin (25) had found that 
the superior speakers they studied had 


*These intensity levels were measured with 
an H. H. Scott sound level meter, Type 
410-A, with C setting. 


an average range in syllabic power of 
approximately 20 db, with a standard 
deviation of 4.4 db, it was felt that the 
selected intensity levels would repre- 
sent relatively soft and loud voices. 


Estimate of Habitual Pitch. To de- 
termine whether he could reliably 
judge habitual pitch, the experimenter 
instructed 21 male cleft palate speakers 
to (a) repeat a standard test sentence 
five times and (b) discuss a subject like 
school, home town, or hobby for a 
short period of time. These two types 
of speech samples were tape recorded. 
From the recordings, the habitual pitch 
of each speaker was estimated by 
matching the experimenter’s subjective 








judgment of habitual pitch with a pure 
tone generated from a direct-reading 
signal generator. After a lapse of one 
week and with a new random order of 
speaker voices, a second estimate of 
habitual pitch was made from each 
type of sample. The resultant Pearson 
rs were .98 and .97, respectively, for 
the repeated estimates of habitual pitch 
from these two types of speech sam- 
ples; the ¢ tests of mean differences in 
estimate from judgment to rejudgment 
failed to show significant differences. 
It was thus felt that considerable con- 
fidence could be placed in his ability 
to estimate habitual pitch reliably. 

Overview of Final Recording and 
Judgment Procedure. Fifteen male cleft 
palate speakers who had undergone 
voice change were instructed to pho- 
nate each of six vowels—[i], [u], [e], 
[o], [ze], and [a]—for 2.5 sec at each 
of two pitch levels (habitual pitch in 
cps and a pitch level 1.4 times higher) 
and two intensity levels (75 db SPL 
and 85 db SPL). These vowel phona- 
tions were tape recorded. In all, 24 
vowel phonations were tape recorded 
for each subject. The 360 recorded 
vowel phonations were randomized for 
pitch levels, intensity levels, vowels, 
and subjects. They were played back 
for auditor judgments of severity of 
nasality, breathiness, harshness, and 
hoarseness. 

Equipment. Figure 1 is a simplified 
diagram of the equipment used to ob- 
tain the recordings of the test vowels 

phonated by the subjects at the dif- 
ferent experimental pitch and intensity 
levels. A steel spacing rod, % in. in 
diameter, was used to control mouth- 
to-microphone distance at 6 in. Inten- 
sity of phonation was controlled by an 


116 Journal of Speech and Hearing Research 


amplifier unit which had been cali- 
brated with an H. H. Scott sound level 
meter, Type 410-A (C setting), to en- 
able it to act as a sound pressure meter. 
For fine monitoring, a VU meter was 
used. To help the subjects maintain 
a proper pitch level while phonating 
the vowels, a Hewlett Packard audio 
oscillator, Model 200 CR, with monau- 
ral headphones for subject and experi- 
menter, was employed. Acting as a 
signal generator, it supplied pure tones 
that represented the test pitch levels 
for each subject. 


Because it was necessary to obtain 
the recordings of the subjects in their 
homes, a Pentron tape recorder, Model 
TR 4, with crystal microphone and 
tape speed of 7.5 in. per second, was 
used as the recording instrument. Its 
frequency response from 50 to 9500 
cps is advertised as +3 db, excluding 
the speaker. Frequency response tests 
indicated that the instrument was ade- 
quate for recording the subjects’ voices. 
However, the playback for judgments 
of voice quality required the use of a 
Magnecorder, Model PT6-A, operat- 
ing at a tape speed of 7.5 in. per sec- 
ond; a Magnecorder amplifier, Model 
PT6-J; and an Altec Lansing, Model 
604C, 15-in. coaxial speaker. 


Subjects. Fifteen subjects were used 
in the recording procedure. They 
ranged in age from 15 to 42 years, with 
a mean age of 19.1 years. Six subjects 
had postoperative palates, six were 
wearing prostheses, and three had neith- 
er type of palatal restoration although 
they were in the process of being fitted 
with prostheses. The subjects had un- 
dergone 0 to 200 hours of individual 
therapy; the average therapy time was 
80 hours. They ranged in habitual pitch 
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from 96 to 163 cps, with a mean habit- 
ual pitch of 133.1 cps. This average ha- 
bitual pitch is very similar to that 
reported by Fairbanks (6, pp. 168,169) 
for adult male superior speakers. How- 
ever, since all potential subjects could 
not master the experimental pitch and 
intensity conditions, and since a larger 
group of subjects had been found to 
possess a somewhat lower mean habitual 
pitch in a preliminary study, the aver- 
age habitual pitch of the 15 subjects 
finally employed does not necessarily 
represent that of adult male cleft palate 
speakers in general. 


Recording Procedure. The habitual 
pitch of each subject was determined 
by using the method described by Fair- 
banks (6, pp. 166, 167). Since some 
subjects were poor readers, a standard 
test sentence which was to be repeated 
over 21d over was substituted for the 
reading passage. The experimenter’s 
subjective estimates of habitual pitch 
for two trials were numerically trans- 
lated into cps by matching the vowel 
tones with pure tones from the direct- 
reading signal generator. The average 
of these two numerical values was re- 
garded as habitual pitch, while the 
higher pitch level was determined sim- 
ply by multiplying habitual pitch in 
cps by 1.4. In their repetitions of the 
test sentence the subjects had a tend- 
ency to drop their average pitch levels. 
When this was the case, the subject 
was trained further until the appropri- 
ate pitch level was discovered. 

Prior to the tape recording of each 
vowel phonation, each subject was 
trained to sustain the test vowel for 2.5 
sec at the required pitch and intensity 
level. Pitch level was controlled by re- 
quiring the subject to match his pitch 
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with the pure tone heard in his head- 
phone. The subject monitored intensity 
by keeping the VU meter steady if 
possible and within + 2 db of the re- 
quired level. 

All test vowels for both intensity 
levels were first phonated by the sub- 
jects at the higher pitch lev el, then at 
the lower pitch level. An important 
reason prompted this particular pro- 
cedure. It had been found that some 
subjects could be trained to phonate 
the vowels at a given pitch level only 
with considerable difficulty. Once they 
succeeded, they could continue to pro- 
duce that pitch on successive trials, 
even with the required changes in in- 
tensity. When they were required to 
change pitch on successive trials, how- 
ever, they had to be retrained, some- 
times with great consumption of time. 

For the 12 phonations within each 
pitch level, the six test vowels were ro- 
tated with the two intensity levels. The 
signal generator was set to emit a bare- 
ly audible continuous pure tone repre- 
senting the test pitch level. For training 
the subjects and recording each vowel 
phonation, the sound-pressure-meter 
amplifier was set at the appropriate in- 
tensity setting, and the tape recorder 
volume control was set at the appro- 
priate ‘best recording level.’* Tone con- 
trols of the sound pressure meter and 
tape recorder remained constantly at 
flat setting. 

There were three exceptions to this 
procedure: (a) Four subjects who had 
moderately severe to severe monaural 
hearing loss did not wear headphones 
but were trained to produce the re- 


“Best recording level’ was defined as that 
maximum gain setting which allowed all 
test vowels to be recorded at a given in- 
tensity level without distortion. 
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quired pitch level by the experimenter. 
(b) Two subjects could phonate [u] 
with an intensity no higher than 81 or 
82 db. In each case the tape-recorder 
gain control was advanced to permit a 
best recording. (c) Three subjects 
could not be initially trained to pro- 
duce the higher pitch level and were 
tested first through the habitual pitch 
level. 

The time required to train the sub- 
jects and obtain the recorded vowel 
phonations varied from about 40 min 
to 1 hr, 45 min. 

In the preliminary test of the fre- 
quency response of the recording mi- 
crophone, amplifier, and recording 
head, as well as in the playback 
through the Magnecorder PT-6, the 
greatest deviation (about 5 db) in an 
otherwise markedly flat overall fre- 
quency response was for 200 cps. With 
an average habitual pitch of 133.1 cps, 
the subjects phonated at their higher 
pitch level with fundamentals in this 
approximate energy area. Unfortu- 
nately a correction had not been made 
for this possible source of error. 


Preparation of Recordings. A total 
of 360 separate vowel phonations (24 
for each of 15 subjects) was recorded. 
The tapes for each were cut and coded 
for subjects, vowels, pitch levels, and 
intensity levels. They were then reas- 
sembled in a randomized order, with 
an additional provision that no two 
consecutive vowels would be from the 
same subject. Announcements for the 
order of presentation were inserted in 
the tape before each vowel phonation, 
and a 45-in. length of blank tape fol- 
lowed each to allow 6 sec for rating 
voice quality. 

To obtain sample vowel phonations 


for training the judges prior to each 
rating session, the experimenter selected 
those which, in his opinion, represented 
extremes in the continuum for each 
voice quality attribute being judged. 
Thus two extremely nasal, two ex- 
tremely breathy, two extremely harsh, 
and two extremely hoarse samples were 
chosen, along with several vowel pho- 
nations which appeared to be free from 
any disturbing voice quality character- 
istic. Finally, four vowel phonations 
were picked at random. For each voice 
quality characteristic to be rated, the 
appropriate vowel-phonation dubs were 
assembled in this order: (a) extremely 
defective, (b) normal or nondefective, 
(c) extremely defective, (d) normal or 
nondefective, and (e) randomly select- 
ed. Thus, there were five vowel-phona- 
tion dubs for training the judges prior 
to the actual judgments of each voice- 
quality attribute: nasality, breathiness, 
harshness, and hoarseness. The 20 
vowel-phonation dubs were placed at 
the beginning of reel I. 


Voice Quality Judgments. In the se- 
lection of the four voice quality charac- 
teristics to be judged, Fairbanks (6) 
was the authority followed. It was 
reasoned that clear definitions which 
include descriptions of the perceptual 
characteristics of each type of deviant 
voice quality must be formulated if 
meaningful judgments are expected. 
Therefore, in each rating session the 
voice quality attribute to be rated was 
discussed in detail with the judges, and 
operational definitions were given. For 
nasality, the critical characteristic to 
be judged was degree of perceived 
nasal resonance (6, p. 203). Denasality 
was defined as ‘no nasality,’ to be rated 
as zero on the scale. Breathiness was de- 
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TasLe 1. Summary of analysis of variance to test effects of pitch levels, intensity levels, and vowels 


on summed ratings of nasality. 








Source 





df ms - F ost 
Pitch Levels (P) 1 336.400 32.74 4.60 
Intensity Levels (I) 1 88.010 12.80 4.60 
Vowels (V) 5 162.932 10.81 2.37 
Subjects (S) 14 156.517 
PI 1 .100 .03 4.60 
PV 5 7.246 1.13 2.37 
PS 14 10.275 
IV 5 6.404 1.43 2.37 
IS 14 6.874 
Vs 70 15.075 
PIV 5 4.240 Fy 3! 2.37 
PIS 14 3.737 
PVS 70 6.407 
IVS 70 4.482 
PIVS 70 5.963 
Total 359 








*F ratios: msp/mspg; msz/msjg; msy/msyg; mspy/mspyg; MSpy/MSpys; Mszy/Mszyg; MSpry 


/msprys. 


TF 95 is the tabled value for the nearest given df. 


fined after West, Kennedy and Carr 
(31, p. 417) as ‘a vocal defect in which 
a portion of the expired air is not re- 
solved into tone.’ For this attribute, the 
judges were cautioned to listen for the 
audible rushing of air (6, p. 209) in 
making their judgments. Harshness was 
defined as ‘characterized by a noisy, 
raspy, unmusical tone. When low 
pitched it frequently is described as 
“guttural”; when high pitched the term 
“strident” sometimes is applied’ (6, p. 
213). Fairbanks’ (6, p. 216) definition 


of hoarseness also was used: ‘From both 
acoustical and causal points of view 
hoarseness combines the features of 
breathiness and harshness.’ 

Four graduate students with exten- 
sive experience in-judging voice qual- 
ity were employed as the judges. For 
the judgments of each voice quality 
characteristic, a seven-point equal-ap- 
pearing-intervals rating scale was em- 
ployed. The scale in each case ranged 
from zero, representing no voice qual- 
ity disorder, to six, representing an ex- 


TaBLe 2. Mean severity ratings for each voice quality characteristic at the various experimenta 


pitch and intensity levels. 








Mean Severity Ratings 





Experimental Conditions Nasality Breathiness Harshness Hoarseness 

Pitch Levels 

HP 3.84 3.04 2.49 1.71 

1.4 HP 3.36 2.96 2.18 1.38 
Intensity Levels 

75 db SPL 3.73 3.77 2.21 1.81 

85 db SPL 3.48 2.23 2.46 1.28 
(General Means) 3.60 3.00 2.33 1.55 
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TaBLe 3. Summary of analysis of variance to test effects of pitch levels, intensity levels, and vowels 
on summed ratings of breathiness. 











Source df ms F* F ost 
Pitch Levels (P) 1 11.380 .50 4.60 
Intensity Levels (I) 1 3397 .880 165.15 4.60 
Vowels (V) 5 139.558 12.84 2.37 
Subjects (S) 14 108.517 
PI 1 25.600 2.89 4.60 
PV 5 18.470 2.27 2.37 
PS 14 22.610 
IV 5 66.718 7.03 2.37 
IS 14 20.574 
VS 70 10.871 
PIV 5 15.000 2.14 2.37 
PIS 14 8.856 
PVS 70 8.125 
IVS 70 9.493 
PIVS 70 7.006 
Total 359 








*F ratios: msp/mspg; msj;/msjg; msy/msyg; mspz/msprg; mspy/Mspyg; msyzy/mMszyg; mMspry 
/msprys, 
tF 5 is the tabled value for the nearest given df. 


treme voice quality disorder. ror all each, one voice quality attribute was 
judgments, discrete numbers were re- judged, and 40 consecutive vowel 
quired. phonations were re-rated for purposes 

To obtain the judgments, the re- of a reliability check. Rest periods of 
corded phonations were played back 10 minutes were given after each reel 
in a sound-treated room at a standard of 120 vowel phonations. The order in 
level of comfortable loudness. Four which the voice quality characteristics 
separate rating sessions were held. In «were judged in the separate rating ses- 


TasLe 4. Summary of analysis of variance to test effects of pitch levels, intensity levels, and vowels 
on summed ratings of harshness. 











Source df ms F* F ost 
Pitch Levels (P) 1 138.130 7.09 4.60 
Intensity Levels (I) 1 99.220 4.93 4.60 
Vowels (V) 5 206 . 962 19.91 2.37 
Subjects (S) 14 77.288 
PI 1 .810 ll 4.60 
PV 5 15.524 2.20 2.37 
PS 14 19.494 
IV 5 9.560 1.43 2.37 
IS 14 20.119 
Vs 70 10.394 
PIV 5 5.640 75 2.37 
PIS 14 7.076 
PVS 70 7.058 
IVS 70 6.701 
PIVS 70 7.553 
Total 359 








*F ratios: msp/mspg; msj/msjg; msy/msyg; mspy/mspyg; Mspy/mspyg; msyy/msyyg;_ MSpry 
/msprys. 
TF .o5 is the tabled value for the nearest given df. 
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TaBLE 5. Summary of analysis of variance to test effects of pitch levels, intensity levels, and vowels 


on summed ratings of hoarseness. 








Source 





df ms a F ost 
Pitch Levels (P) 1 164.020 11.56 4.60 
Intensity Levels (I) 1 390.620 34.46 4.60 
Vowels (V) 5 60.310 11.51 2.37 
Subjects (8) 14 33.642 
PI 1 230 .04 4.60 
PV 5 4.332 1.39 2.37 
PS 14 14.186 
IV 5 22.878 5.30 2.37 
IS 14 11.334 
vs 70 5.239 
PIV 5 1.158 27 2.37 
PIS 14 6.159 
PVS 70 3.121 
IVS 70 4.315 
PIVS 70 4.236 
Total 359 








* F ratios: msp/mspg; msy;/msjg; msy/msyg; Mspy/Mspyg; MSpy/MSpyg; Mspy/Mszyg; MSpry 


/msprys- 


{F 95 is the tabled value for the nearest given df. 


sions was as follows: nasality, breath- 
iness, harshness, and hoarseness. With 
about two hours required for each rat- 
ing session, the total procedure for ob- 
taining auditor judgments entailed ap- 
proximately eight hours of time. 


Results 


Reliability of Criterion Measures. 
The criterion measure was the sum of 
the four ratings by four judges for 
the phonation of one vowel by one sub- 
ject. Reliability was evaluated for each 
set of 40 repeated measures (summed 
ratings) and the corresponding initial 
measures, by means of the Pearson r 
procedure along with the ¢ test of the 
difference between means. The ob- 
tained rs for nasality, breathiness, harsh- 
ness, and hoarseness measures were .74, 
.88, .85, and .75, respectively. None of 
the differences between means were 
significant at the 5% level. Although 
the reliability coefficients were some- 
what lower than would be desirable, it 


was assumed that the summed ratings 
would be sufficiently reliable for the 
purposes of this study. 


Method of Analysis. The obtained 
measures for each of the four voice 
qualities were analyzed by analysis of 
variance (15, pp. 254, 257) with four 
factors: pitch levels, intensity levels, 
vowels, and subjects in a 2 x 2x 6x 15 
classification. There were 360 measures 
for each analysis. 


Effect of Pitch and Intensity on Per- 
ceived Nasality. Pitch levels, intensity 
levels, and vowels* exerted a significant* 
effect on the degree of nasality per- 
ceived (see Table 1). The observed ef- 
fects of pitch and intensity are reported 
in Table 2. The differences between 
means indicate less nasality at the 
higher pitch level than at the habitual 


‘Specific effects of vowels on perception of 
voice quality are not reported in this article. 

‘In this and in subsequent analyses, the 5% 
level was required for significance. 
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pitch level and less nasality at the more 
intense level, 85 db SPL, than at the 
less intense level, 75 db SPL. 


Effect of Pitch and Intensity on Pex- 


ceived Breathiness. Degree of per- i 


ceived breathiness varied significantly 
with intensity levels and with vowels, 
the effect of pitch, however, was not 
significant (see Table 3). Perceived 
breathiness was markedly less severe at 
the more intense vocal level than at the 
less intense level (see Table 2). The 
significant vowel-intensity interaction 
is the result, apparently, of relatively 
small differences (see Table 2) for [u] 
and for [o] between mean severity 
measures obtained for the two inten- 
sity levels. 

Effect of Pitch and Intensity on Per- 
ceived Harshness. Severity of perceived 
harshness varied significantly as an ef- 
fect of pitch levels, of intensity levels, 
and of vowels (see Table 4). Harshness 
is on the average less severe for the 
higher pitch level than for habitual 
pitch level and less severe for the lower 
intensity level than for the higher in- 
tensity level (see Table 2). 

Effect of Pitch and Intensity on Per- 
ceived Hoarseness. Pitch levels, inten- 
sity levels, and vowels exerted a signifi- 
cant effect on the degree of hoarseness 
perceived by the judges (see Table 5). 
Also significant is the interaction of 
intensity levels with vowels. An exami- 
nation of the data for vowels revealed 
that, while the subjects were heard as 
considerably less hoarse at 85 db SPL 
while phonating the vowels [a], [z], 
and [e], this effect did not hold as 
markedly for the vowels [ul], [i], and 
[0]. There was thus consistency of 
direction of trend and the test of the 
main effect of intensity levels has a 


meaningful interpretation regardless of 
the significant interaction. The mean 
severity ratings for hoarseness were 
lower at the higher pitch level than at 
habitual pitch level and lower for the 
higher intensity level than for the 
lower intensity level (see Table 2). 

Summary of Results. Examination of 
Tables 1, 3, 4, and 5 shows that the 
interactions of pitch with intensity are 
nonsignificant for each of the four 
quality disorders studied. The effects of 
intensity on the perception of each of 
the four different voice quality attri- 
butes may thus be assumed to be con- 
sistent from one pitch level to another 
(and vice versa). 

In summary, at the higher pitch level 
(1.4 times the numerical value in cps 
for habitual pitch) nasality, harshness, 
and hoarseness are significantly less 
severe than at habitual pitch level. The 
pitch level effect for breathiness is non- 
significant. For nasality, breathiness, 
and hoarseness the severity is sig- 
nificantly less for the higher intensity 


‘than for the lower intensity. Harshness 


is significantly more severe at the high- 
er intensity. 


Discussion and Coziclusions 
Vowels as Speech Samples. Al- 
though, as previously explained, stand- 
ard intensity levels for the vowel pho- 
nations were desired in the present 
study, several experimental observa- 
tions led the writer to question their 
use with different vowels. As previously 
explained, two subjects could not pho- 
nate [u] at 85 db SPL. Many of the 
subjects phonated [i] only with diffi- 
culty at this intensity level, whereas 
they had little difficulty with the other 
vowels, particularly [a] and [z]. At 
the lower (75 db SPL) level, however, 
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they tended to exhibit greater vocal 
control in their phonation of [i] and 
[u], the very vowels with which they 
had experienced difficulty at the more 


intense phonation level. Black (3) re- - 


ported lower average intensities and 
peak power for the high vowels than 
for the low vowels in connected speech. 
The present observations on phonations 
of different vowels would suggest that 
high vowels may typically have lower 
intensities for the reason that they can- 
not be phonated at higher intensities 
with the ease that characterizes the 
phonation of low vowels, while the 
converse would hold true at lower in- 
tensities. Thus, it would seem reason- 
able that the use of standard intensity 
levels for the phonation of different 
vowels might not result in findings 
that are truly representative of some 
of the vowels as they are normally 
phonated in connected speech. An ob- 
jection also might be raised to the use 
of sustained vowels, since they cer- 
tainly are not typical of connected 
speech. 

Assuming negligible error in the 
manner by which sound pressure level 
was controlled in the present study, 
different vowels also apparently have 
different ‘best recording levels.’ In sim- 
ilar future studies a potential source of 
error might be avoided by correction 
for these differences, for example, by 
the use of a ‘best recording level’ for 
each particular test vowel. 


Pitch and Intensity. The present 
finding that nasality is perceived in 
lesser degree at higher pitch levels is 
in agreement with Froeschels (9). It 
would disagree with earlier opinions 
of Gray and Wise (J0) and Holmes 
(11), as well as the findings of Sher- 
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man and Goodwin (24), who, however, 
used functionally nasal subjects and a 
connected speech sample in their study. 

‘Weiss’s (29) finding of a moderate 
but significant correlation (—.57) be- 
tween oral sound pressure levels and 
judged severity of nasality led him to 
conclude that the louder a nasal voice 
is, the less nasal it is heard to be. The 
present findings tend to corroborate 
his conclusions, as well as statements 
by Williamson (32) and Cotton (5) in 
this regard. 

Fairbanks (6, p. 210) states that 
breathiness may be caused by the use 
of a pitch level lower than natural 
pitch. Since the natural pitch of the 
subjects in the present study was not 
determined, the present findings for the 
negligible effect of pitch on perceived 
breathiness cannot be used to evaluate 
this statement. The observation that 
breathiness is perceptibly reduced at 
greater vocal intensity would strongly 
corroborate Fairbanks’ (6, p. 212) 
statement that a very loud voice cannot 
at the same time be breathy. 

Harshness may result from ‘strain to 
produce adequate loudness, particularly 
at low pitches,’ according to Fairbanks 
(6, p. 214). Since the highest mean rat- 
ings for severity of harshness were as- 
signed to the louder vowel phonations 
at hibitual pitch, and since all the more 
loudly phonated vowels were perceived 
on the average as more harsh, the pres- 
ent results tend to confirm his state- 
ment. 

That hoarseness tends to be per- 
ceptually reduced at pitch levels higher 
than habitual pitch is apparently true 
for functionally hoarse speakers (33), 
persons whose hoarseness is related to 
organic conditions such as laryngeal 





contact ulcers (/9), and adult male 
cleft palate speakers. However, as dif- 
ferent authors (/, 9, 3/, p. 141) attest, 
the constant use of unnaturally high 
pitch levels also may involve undesir- 
able vocal consequences. The finding 
that the subjects were perceived as less 
hoarse while phonating more intensely 
is not what one might predict from 
statements by certain writers (19, 20, 
30). It would seem pertinent to point 
out that this particular experimental 
result might be fallacious if unreserv- 
edly applied to speaking behavior. Most 
persons, as occasions may demand, can 
vocalize with tonal clarity at much 
greater intensities than they normally 
use. It is, however, a recognized fact 
that continued use of loud voice may 
lead to serious vocal disorders. 

Implications for Therapy. The pres- 
ent results, along with the observation 
of Cobb and Lierle (4) that vocal 
weakness and restricted pitch variation 
characterized their cleft palate sub- 
jects, lead this writer to conclude that 
speech therapists may profitably experi- 
ment with judiciously elevated pitch 
and increased intensity as therapeutic 
adjuncts to enable the cleft palate 
speaker to reduce at least some of his 
perceptually disturbing voice quality, 
particularly nasality. 

Several cautions should be kept in 
mind in applying these conclusions to 
speech therapy. Findings reported by 
other investigators (/7, 27) suggest that 
the nasality perceived in cleft palate 
speech may be partly a function of 
articulatory proficiency. Moreover, 
Van Hattum (27) concluded that 


nasality judged from isolated, sustained 
vowels is an inadequate predictor of 
nasality judged from connected speech. 
Further, it is recognized that, to some 
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degree, the judges in the present study 
may have been required to rate a per- 
ceptual artificiality caused by the sub- 
jects’ prolongation of vowels at inten- 
sity levels which may not be typical of 
their normal production in connected 
speech. The procedure by which in- 
tensity was controlled may well have 
caused the subjects to depart from their 
normal manner of vocal production in 
their attempts to keep the VU meter 
needle steady. Finally, the number of 
experimental pitch and intensity levels 
was quite limited. 

Van Riper (28, p. 472) has described 
a procedure for teaching cleft palate 
speakers to use low pitches and low 
intensities while blending vowels as a 
preliminary to articulation therapy. 
However, he gives no rationale for this 
therapeutic technique. If improvement 
in voice quality is his intended goal, 
the present results might raise a ques- 
tion on the routine use of this clinical 
procedure. 


‘Summary 


The effects of pitch and intensity 
level on perceived cleft palate voice 
quality were studied. Fifteen adult male 
cleft palate subjects were instructed 
to phonate six test vowels at two pitch 
levels (habitual pitch and a_ higher 
pitch) and two intensity levels. Tape 
recordings of these vowel phonations 
were played back for auditor judg- 
ments of nasality, breathiness, harsh- 
ness, and hoarseness. 

According to the obtained results, 
nasality, harshness, and hoarseness are 
less severe at the higher pitch level than 
at the habitual pitch level. Breathiness 
is unaffected by pitch level. At the 
more intense level of phonation there 





a ta OO CO ™ 


. 


is less nasality, less breathiness, more 
harshness, and less hoarseness than at 
the less intense level. 
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Serious confusion exists as to the dif- 
ference between the sound pressure 
levels which characterize the normal 
threshold for a 1000-cps pure tone and 
the normal threshold for speech. Esti- 
mates as to the size of this discrepancy 
disagree by as much as 10 db. In con- 
sequence, there is need for clarification 
of the situation since the relationship 
between these two thresholds must be 
taken into account in establishing co- 
ordinated audiometric standards. 

The problem at hand is quickly ap- 
parent when one compares the norms 
as presently described by the American 
Standards for Audiometers with the 
findings of various studies reported in 
the literature. The American Standard 
Specification for Audiometers for Gen- 
eral Diagnostic Purposes (1) defines 
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the normal threshold sound pressure 
level for 1000 cps as 16.5 db (SPL re- 
ferred to 0.0002 microbar in a National 
Bureau of Standards Coupler 9-A), for 
the Western Electric type 705-A ear- 
phone. The American Standard Speci- 
fication for Speech Audiometers lists 
22 db (SPL referred to 0.0002 microbar 
in an American Standard type-1 Coup- 
ler) as the norm for speech. Indeed, 
the Specification explicitly states that, 
‘, .. the purpose of this requirement 


_[22 db SPL] is to set the 0 hearing 


loss for speech at a level about 6 db 
[authors’ italics] above the “normal” 
[sic] threshold for a pure tone of 1000 
cps as defined in American Standard 
Audiometers for General. Diagnostic 
Purposes ...’ (2, p. 9). The Specifica- 
tion further implies that the 6-db dif- 
ference is recognized as an approxima- 
tion which may require revision. 
Sporadic evidence (3, 4, 6, 9, 11) 
suggests that the difference in question 
is considerably greater than 6 db. Davis 
(6), for example, reports the average 
of the thresholds for 500, 1000, and 
2000 cps as 9 db (SPL) at the same time 
that he gives thresholds for various 
speech materials which range from 22 
db for spondees, through 26 db for 
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sentence material and digits, to 33 db 
for PB words as spoken by Rush 
Hughes. Lightfoot, Carhart, and Gaeth 
(11) observed a 16.5-db difference be- 
tween the threshold intensities for 1000 
cps and for spondee words exhibited 
by 31 otologically normal subjects. The 
most definitive finding to date, how- 
ever, is derived from the 1954 Wiscon- 
sin State Fair Survey (9). Here, the 
difference between the averages of the 
median threshold values for a 1000-cps 
pure tone and for spondee words, re- 
ported for all ears in the ‘selected 
normal group’ was 15 db. Discrepancies 
of relatively similar size characterize 
the results obtained in the Survey for 
samples (by decades) of the general 
population, although the data as pre- 


‘sented must be converted to sound 


pressure levels before the fact is fully 
apparent. Most recently, there is the in- 
direct evidence to be derived from 
Corso’s (4, 3) studies of normals’ 
thresholds for pure tones and for CID 
Auditory Test W-2. On relatively 
large groups of normal listeners, he ob- 
tained threshold SPLs of about 5 db for 
a 1000-cps pure tone and approximately 
19 db for the W-2 spondee recording. 
These data imply that the difference 
between thresholds is on the order of 
14 db. 

Recent evidence (3, 4, 7, 10), par- 
ticularly the work of Dadson and King 
(5) in England, has made it imperative 
to ask whether the present American 
norms for pure tones are correct. 
There is pressure in many quarters to 
alter these norms as a requisite step 
toward establishment of an interna- 
tional standard for pure-tone audio- 
meters. This situation, among other 
things, intensifies the need to define 
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the relation between thresholds for 
pure tones and for speech. 


It is now clearly apparent from evi- 
dence such as that obtained in the Wis- 
consin State Fair Survey (9) that 
young adults yield better thresholds 
than a less select group of ‘normal’ 
listeners derived from the population 
at large. Moreover, the present Amer- 
ican norm for speech (22 db SPL) ap- 
parently represents performance of se- 
lected young adults (the so-called ‘lab- 
oratory ear’). The American norm for 
pure tones, on the other hand, is based 
on the responses of a less restricted 
sampling (12). Here the ‘man on the 
street with “normal” hearing’ served 
as the referant. Thus, the two current 
American Standards seem to be in dis- 
agreement as to the category of nor- 
malcy on which they rest. If so, the 
situation must be rectified by relating 
both sets of audiometric specifications 
to the same criterion population. 

The first step toward such a unifica- 
tion of audiometric standards is to de- 
fine the difference in intensity between 
thresholds for pure tones and for 
speech. Particularly pressing in this re- 
spect, since the sound pressure level of 
a speech signal is defined in terms of 
the sound pressure level of an equiva- 
lent 1000-cps pure tone (2), is the need 
to know the relative acuity for speech 
and for 1000 cps. 

The present investigation was under- 
taken to explore the latter need. The 
experimental problem was to ascertain 
the physical discrepancy between 
thresholds for a 1000-cps pure tone and 
for speech. The specific procedure was 
to measure both thresholds in the same 
normal-hearing subjects. Recorded 
spondees were employed as the speech 
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Figure 1. Block diagram of apparatus used to measure both pure-tone and spondee thresholds. 
The core of the equipment is a commercially-available speech audiometer feeding a PDR-10 


earphone. 


material. Variables suspected of having 
a critical influence on the relationship 
between the two thresholds were ex- 
amined. These variables included so- 
phistication of the listener, effect of 
practice, method of threshold deter- 
mination, order of test administration, 
sex, ear, and familiarity with test vo- 
cabulary. Two groups of 10 subjects 
each were compared in examining the 
first two variables, while a third group 
of 96 subjects was used in a counter- 
balanced routine designed to study the 
remaining five variables. 


Apparatus 


Figure 1 shows a simplified block 
diagram of the experimental apparatus 


“used to measure both pure-tone and 


spondee thresholds. The core of the 
equipment consisted of a commercially- 
available speech audiometer (Grason- 
Stadler, type 162) feeding a PDR-10 
earphone mounted in an MX41/AR 
cushion. All spondee thresholds were 
obtained by playing either recorded 
list E or recorded list F of CID Audi- 
tory Test W-1 through this speech 
audiometer. 

Two separate pure-tone sources were 
fed through the speech audiometer 
to the same PDR-10 earphone. One 
source consisted of an audio-oscillator 
(General Radio, type 1304A) whose 
output was controlled by an electronic 
switch (Grason-Stadler, type 829). The 
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electronic switch was, in turn, trig- 
gered by an electronic timer (Grason- 
Stadler, type 471), to produce the de- 
sired temporal pattern of the pure-tone 
stimulus. Stability of both the oscillator 
frequency and the duration of each 
short tone passed by the switch was 
assured by continuous monitoring of 
both oscillator and switch outputs with 
a counter-timer (Berkeley, type 5500). 
This pure-tone source was used to 
measure the 1000-cps threshold by a 
rigidly defined psychophysical proce- 
dure described below. The second 
pure-tone source was an ordinary pure- 
tone diagnostic audiometer (ADC, 
model 53C). This source was used to 
measure the 1000-cps threshold by the 
‘clinical’ method which was studied. 

The electrical signal across the ear- 
phone was monitored with a vacuum- 
tube voltmeter (Hewlett-Packard, 
Model 400C) and a cathode-ray oscil- 
lograph (Dumont, Model 304-A). 

The acoustic output of the apparatus 
was calibrated by means of an ASA 
Type-1 coupler, calibrated condenser 
microphone (Western Electric, type 
640AA), cathode follower (ADC, 
D5153) and vacuum-tube voltmeter 
(Hewlett-Packard, Model 400-A). All 
thresholds, both pure-tone and speech, 
are thus subsequently reported as the 
sound pressure level, referred to 0.0002 
microbar, developed in an ASA Type-1 
coupler. 

The sound pressure level developed 
by the earphone at 1000 cps was meas- 
ured daily throughout the course of the 
experiment. The maximum variation in 
output over the five-month period dur- 
ing which subjects were tested was 1.4 
db, and the day-to-day variation ex- 
hibited no systematic trend over time. 


Pressure: levels developed at octave fre- 
quencies from 125 to 8000 cps were 
measured weekly over the five-month 
period, and demonstrated equivalent 
stability. 

Prior to the experiment, precautions 
were taken to insure that the speech 
audiometer conformed to all require- 
ments listed in the American Standard 
Specifications for Speech Audiometers 
(2) with particular reference to tests 
of overall acoustic fidelity. The ap- 
paratus equalled or exceeded all listed 
specifications. 


Threshold Measurement Techniques 


In order to evaluate the possible ef- 
fect of threshold measurement tech- 
nique on the relationship between pure- 
tone and speech threshold, two sepa- 
rate and distinct methods for defining 
each type of threshold were employed. 
One, the ‘up-and-down’ method (8) 
was characterized by a relatively rigid- 
ly-defined set of operations. The other, 
hereafter called the ‘clinical’ method, 
was as nearly as possible a duplication 
of the procedures commonly employed 
by audiologists within the context of 
the clinical situation. 


Up-and-Down Method. The up-and- 
down method was selected in prefer- 
ence to any of the three classical psy- 
chophysical metiods or their variants 
because it permitted a relatively iden- 
tical stimulus temporal pattern and 
measurement procedure for both pure- 
tone and speech thresholds. ae €s- 
sence of the method is that the inten- 
sity level of each successive stimulus is 
determined by the subject’s response to 
the previous stimulus. Jf the subject 
does not respond at a particular inten- 
sity, the next level of the stimulus is 
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increased by a predetermined fixed 
amount. On the other hand, if the suo- 
ject does respond at a particular inten- 
sity, the level of the next stimulus is 
decreased by the same amount. This 
simple rule, raising the intensity when 
the subject does not respond and low- 
ering it when he does respond, is fol- 
lowed throughout the course of a pre- 
determined number of stimulus presen- 
tations. The result is a series of response 
measures which oscillate about the 
threshold intensity. From these data 
the intensity level corresponding to 
50% response may be determined by 
suitable arithmetic computation (8). 

In the present experiment the in- 
tensity levels of successive stimuli were 
altered in 2-db steps over a series of 36 
presentations. For the spondee thresh- 
old, the block of successive stimuli con- 
sisted of the 36 spondee words recorded 
as lists E or F of CID Auditory Test 
W-1. For the 1000-cps threshold, the 
stimulus sequence was a train of 36 
short tones recurring at 6-sec intervals. 
Each short tone had a rise-decay time of 
50 msec, and a duration, at maximum 
amplitude, of 500 msec. The 6-sec rep- 
etition rate for the pure tones was se- 
lected to match the rate at which the 
spondee words recur on the W-1 re- 
cordings. Thus, insofar as thresholds 
obtained by the up-and-down method 
) are concerned, the experimental pro- 

cedure was virtually identical for both 
\_ pure-tone and spondee thresholds. The 
only difference was that for the spon- 
dee threshold each stimulus was a word 
which the subject repeated, either cor- 
rectly or incorrectly, while for the 
1000-cps threshold each stimulus was a 
pure tone to which the subject either 
did, or did not, respond. 


. authors to devise their own 


Clinical Method. Devising a satisfac- 
tory analogue to so-called ‘clinical 
audiometric technique’ proved to be 
one of the more difficult problems en- 
countered in this investigation. Numer- 
ous sources were consulted in an at- 
tempt to find some common denomina- 
tor epitomizing the basic operations to 
be followed in the clinical measurement 
of an auditory threshold. The relatively 
small number of even cursorily de- 
scribed procedures found in the litera- 
ture were characterized by a certain 
lack of agreement on some relevant par-: 
ticulars (for example, the number of 
times a stimulus is presented at a given 
level, whether a pure-tone stimulus is 
briefly turned on or briefly turned off, 
whether the threshold criterion is 100% 
response, 50% response, 0% response, 
or some intermediate value). 

Briefly, it seemed apparent that, in 
order to introduce some minimal de- 
gree of objectivity into the ‘clinical’ 
thresholds, it was necessary for the 
method. 
In so doing, the attempt was made to 
follow, as closely as possible, the coun- 
sel of experienced clinicians. In the 
final analysis the authors are able to 
justify the ‘clinical’ procedures ulti- 
mately employed on the sole basis that 
they seemed reasonable to them. 


The clinical procedure adopted for 
the measurement of the spondee thresh- 
old corresponded closely to the method 
described by Newby (13, pp. 119-120) 
for the W-1 records. Two or three 
words were initially presented at a level 
20 to 30 db above the estimated thresh- 
old level. Successive blocks of two or 
three words were then progressively 
attenuated in 10-db steps until a level 
was reached at which two consecutive 
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words were repeated incorrectly. At ascending runs, using 5-db steps, and 
this point, the tester simply ‘jumped presenting just one stimulus per step. 
around,’ in no set order, from level to No attempt was made to control the 
level in 2-db steps, presenting exactly duration of each tonal presentation 
four words per level. The spondee other than to instruct the tester to keep 
threshold was recorded as the lowest the presentation brief. In practice, the 
intensity at which the subject repeated tones were about 1 to 2 sec long. 

two out of four words correctly. In Instructions to each subject tested by} 
the event that the subject repeated either the clinical method or the up- 
three out of four words correctly at and-down method were as follows: 


dl 


= hee 


o* 2 


~ 
. 


one level, and only one out of four 
correctly at the next lower level, 
threshold was recorded as the intensity 
yielding three out of four responses. 
This problem seldom arose in actual 
practice. 


The clinical procedure employed toh 
measure the 1000-cps threshold closely 
resembled the ‘ascending’ technique 
described in the 1951 revision of the 
‘Manual for School Hearing Conserva- 
tion Programs’ (/4, p. 15) prepared by 
the Committee on Conservation of 
Hearing of the American Academy of 
Ophthalmology and Otolaryngology. 
By means of the interruptor switch on 
the clinical audiometer, a brief tone 
was first presented at a level 30 db 
above the estimated threshold in order 
to familiarize the subject with the test 
signal. The tester then descended in 
10-db steps, presenting one brief tone 
at each level, until the subject failed 
to respond. The tester next ascended in 
5-db steps, presenting one brief tone at 
each level, until a response occurred. 
He then decreased the intensity by 10 
or 15 db and again ascended in 5-db 
steps until another response occurred. 
This procedure was followed until the 
subject had responded three times at 
the same level. Threshold was thus de- 
fined as the lowest intensity at which 


The purpose of this study is to measure 
your threshold for tones and for words. 
Two test runs will be conducted, one 
using tones and one using words. 
During the tone test, you will hear a 
short burst of sound followed by inter- 
vals of silence. Each tone will be quite 
short. Some will be easy to hear. Others 
will be very faint. Whenever you hear 
one of these tones, no matter how faint 
it is, press the button. Since the tones will 
be very faint, it is necessary that you 
listen very carefully. 

When I test for your word threshold, 
you will hear a man’s voice saying two- 
syllable words, such as ‘wigwam,’ ‘there- 
fore’ or similar words. Each word will be 
preceded by the phrase ‘Say the word.’ 
It is only necessary for you to repeat the 
two-syllable word, not the phrase. Some 
of the words will be easy to hear. Others 
will be very faint. Whenever you hear 
a word, no matter how faint it is, repeat 
it out loud. You will have to listen care- 
fully since the words will be very faint. 
At the beginning of each word test, you 
will hear several sentences of identifying 
information which you do not need to 
repeat. 

There is no set order for the two test 
runs. You will be told at the beginning 
of each test whether it is to be a word 
test or a tone test. 

After I have placed the phones over 
your ears, it is extremely important that 
you do not move them in any way until 
the tests are completed. I will tell you 
when you can take them off. Any ques- 
tions? 


Subjects 


All subjects were audiometrically 


screened at a hearing level (hearing loss 


the subject responded three times in dial setting) of 10 db re USPHS norm 
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at octave intervals from 125 to 8000 
cps, and at 1500 and 3000 cps, in order 
to insure that each subject had relative- 
ly normal acuity in both ears. 

As already mentioned, three groups 
of subjects were employed. One group 
consisted of 10 sophisticated listeners. 
These individuals were selected from 
the staff and the graduate student popu- 
lation at the Audiological Laboratory. 
Each was highly experienced at the 
task of listening for very faint signals, 
and all were relatively familiar with 
the CID revised spondee word lists. 
They represented essentially ‘labora- 
tory ears.’ Five subjects were male, the 
other five, female. They ranged in age 
from 20 to 31 years. 

The second group was composed of 
10 undergraduate students selected on 
the basis of having had no previous ex- 
perience as listeners in auditory tests 
of any kind. Nine subjects were female, 
the other one, male. They ranged in 
age from 18 to 25 years. 

The third group included 96 sub- 
jects, three for each of 32 separate ex- 
perimental conditions. As was true of 
the second group, these 96 subjects 
were selected from the undergraduate 
population at Northwestern Univer- 
sity and met the requirement that they 
had not had prior experience with any 
auditory tests. No subject was accepted 
who reported any history of either ear 
pathology or excessive noise exposure. 


Prior Exposure to Spondee Words. 
Subjects differed in their familiarity 
with the spondee words. The 10 sophis- 
ticated listeners were well acquainted 
with these materials, while all other 
subjects initially were not. However, 
half of the 96 persons were exposed to 
the spondees immediately prior to the 


measurement of their thresholds for 
the words. Tc this end, these 48 sub- 
jects were given additional instructions 
as follows: 
Before the word test, I will read a series 
of 36 two-syllable words at a level which is 
easy for you to hear. You are to repeat 
each word. These words are the same 
words which you will later hear in the 
word test; however, they will be in a 
different order. Since the purpose of this 
initial reading of the words is to make 
you familiar with the words, please listen 
carefully. 


Procedure and Results 


Preliminary Study. An initial inves- 
tigation was conducted to determine 
the effects of sophistication in auditory 
tests, and of practice upon the rela- 
tionship between thresholds for pure 
tones and for speech. It was for this 
purpose that the first two groups of 
10 subjects were formed. 

Six thresholds were obtained for each 
subject in a single experimental session. 
Four of these thresholds were for pure 


- tones of 500, 1000, 1500 and 2000 cps, 


respectively. The last two were spon- 
dee thresholds obtained separately with 
W-1 list E and with W-1 list F. All 
six thresholds, pure-tone and _ speech, 
were measured by the up-and-down 
method previously described using 2- 
db steps of attenuation. Subsequent 
comparison of the two groups yielded 
information on the effect of familiarity 
with audiological procedures. 

Subjects in the sophisticated group 
underwent the foregoing procedure 
twice. The first run gave these subjects 
experience on the specific tasks in- 
volved. The results obtained during 
this session supplied the base of refer- 
ence against which to estimate the ef- 
fect of a practice session on thresholds 
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TaBLE 1. Mean pure-tone and spondee (W-1) threshold sound pressure levels (db re: 0.0002 microbar 
in ASA type-1 coupler) for sophisticated (N=10) and unsophisticated (N=10) listeners. 











Sophisticated Unsophisticated 
Threshold Group Group 
Session 1 Session 2 
Pure Tone 
500 eps 12.4 12.1 10.3 
1000 eps 6.8 7.3 5.7 
1500 eps 7.4 6.7 rey i 
2000 cps 6.6 bol 8.8 
Spondee 
First Test 17.0 16.8 24.3 
Second Test 16.7 16.6 21.4 
Difference 0.3 0.2 2.9 
Mean 16.8 16.7 22.8 
Mean Spondee Minus 1000 cps 10.0 9.4 Ly! 








measured by the up-and-down method. 

Table 1 summarizes the findings of 
the preliminary investigation. The 
mean pure-tone threshold sound-pres- 
sure levels obtained for the sophisti- 
cated group in the first experimental 
session (practice session) were essen- 
tially equivalent to those obtained in 
the second session (test session). This 
was true also for the mean spondee 
threshold. In addition, when the mean 
spondee thresholds for this group are 
considered, no difference is noted be- 
tween the first and second thresholds 
obtained in either the practice session 
or the test session. 

Further inspection of Table 1 re- 
veals that the pure-tone threshold 
sound-pressure levels are essentially 
equivalent for both sophisticated and 
unsophisticated listeners. When the 
spondee threshold sound-pressure lev- 
els for the two groups are compared, 
however, it becomes apparent that the 
sophisticated listeners yielded much 
lower thresholds than the unsophisti- 
cated listeners. Furthermore, unsophisti- 
cated listeners improved an average of 
about 3 db from the first threshold to 


the second, while sophisticated listeners 
did not. 


It seemed possible that this 3-db 
improvement between the first and sec- 
ond spondee thresholds in the unso- 
phisticated group could be due to the 
fact that the subjects gained knowledge 
of the test vocabulary during the first 
test. The same reasoning would ac- 
count for the lack of improvement on 
the second spondee threshold in the 
sophisticated group. In other words, the 
sophisticated subjects probably already 
had the optimum degree of familiarity 
with the words prior to the administra- 
tion of the first spondee test so that 
further exposure to test vocabulary had 
no demonstrable effect on threshold. If 
this interpretation is correct, one would 
expect the unsophisticated group to 
show further improvement on succes- 
sive retests with spondee words until 
they reached a terminal level of famil- 
iarity, equivalent to that of the sophisti- 
cated group. 

The foregoing results indicated the 
necessity of controlling, in the main 
experiment, both the general factor of 
subject sophistication in auditory tests 








and the specific factor of prior familiar- 
ity with the CID revised Harvard 
spondee words. The decision was made 
to restrict the main experiment to 
audiologically naive listeners because 
such persons are more similar to the 
relatively unsophisticated population 
encountered in clinical situations. More- 
over, it was deemed particularly im- 
portant, since such persons showed 
instability of threshold with successive 
exposures to test material, to explore 
more fully the effect of familiarity with 
test procedure upon threshold levels. 
Information on this point is essential if 
the variability of threshold due to re- 
testing is to be taken into account in 
specifying the criterion population for 
audiometric norms. 


Main Experiment. A five-factor de- 
sign was undertaken to assess the effects 
on thresholds for spondee words and 
for a 1000-cps tone of: (a) threshold 
measurement technique, (b) order of 
test administration, (c) sex, (d) ear, 
and (e) prior knowledge of test vocab- 
ulary. This investigation was under- 
taken with the third group of subjects, 
96 young adults having normal hearing 
but lacking prior experience with audi- 
tory tests. In order to control system- 
atically the five factors under con- 
sideration, the group was appropriately 
divided. Specifically, 48 subjects were 
tested by the up-and-down method, the 
other 48 by the clinical method. Within 
each subgroup, the spondee test was 
administered first to half of the sub- 
jects, the pure-tone test first to the rest. 
Half of the subjects were female, and 
the remainder, male. The right ear was 
tested 50% of the time, and the left ear 
the remaining 50%. The 36 spondees 
were read to half of the subjects prior 


134 Journal of Speech and Hearing Research 


TaBLE 2. Mean 1000-cps and spondee threshold 
sound pressure levels (db re: 0.0002 microbar in 
ASA type-1 coupler) for each method of measure- 
ment (N=96). 











Threshold Method 
Up-and-Down Clinical 
1000 eps 8.1 9.8 
Spondee : 21.6 21.7 








TaBLE 3. Mean 1000-cps and spondee threshold 
sound pressure levels (db re: 0.0002 microbar in 
ASA type-1 coupler) for each sex (N=96). 











Threshold Sex 
Male Female 
1000 eps 9.3 8.6 
Spondee 21.3 22.0 








Taste 4. Mean 1000-cps and spondee threshold 
sound pressure levels (db re: 0.0002 microbar in 
ASA type-1 coupler) for each ear (N=96). 











Threshold Ear 
Right Left 
1000 cps 8.3 9.6 
Spondee yi Me 21.5 








Taste 5. Méan 1000-cps and spondee threshold 
sound pressure levels (db re: 0.0002 microbar in 
ASA type-1 coupler) for each test order (N=96). 











Threshold Test Order 
Spondee First 1000 cps First 
1000 eps 9.7 8.2 
Spondee 22.3 21.0 








TaBLE 6. Mean 1000-cps and spondee threshold 
sound pressure levels (db re: 0.0002 microbar in 
ASA type-1 coupler) according to prior knowledge 
of spondee vocabulary (N=96). 











Knowledge of Spondee Vocabulary 
Threshold Without Prior With Prior 
Knowledge Knowledge 
1000 cps 8.6 9.3 
Spondee 23.0 20.3 
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to measurement of the speech thresh- 
old, while the remaining subjects were 
not given this opportunity to familiar- 
ize themselves with the test items. 

Each subject was seen in a single 
experimental session, during which two 
thresholds were measured, the thresh- 
old for a 1000-cps pure tone and the 
threshold for spondee words. The lat- 
ter was obtained with recorded list E 
of CID Auditory Test W-1. 

In order to visualize the effect of 
each factor separately, Tables 2 through 
6 were prepared to present the mean 
1000-cps and spondee thresholds ob- 
tained for the two categories of each 
factor in turn. Table 2, for example, 
illustrates only the effect produced on 
each threshold by varying the method 
of measurement. The mean spondee 
threshold for subjects in the clinical 
method group is only 0.1 db different 
from the mean threshold for subjects 
in the up-and-down method group. 
There is, however, a 1.7-db difference 
between the two methods for the 1000- 
cps thresholds. This difference is almost 
the exact order of magnitude to be 
expected in view of the difference in 
size of intensity steps used in the two 
methods. For the spondee threshold 
2-db steps were used for both the 
clinical and up-and-down methods. For 
the 1000-cps threshold, however, 2-db 
steps were used in the up-and-down 
method, but 5-db steps were used in 
the clinical method. A theoretical dif- 
ference of 1.5 db, in the direction of 
lower threshold intensity for the 
method involving 2-db steps would 
therefore be predicted. This difference 
derives from the fact that, when 5-db 
steps are used, the mean threshold is 
underestimated by 2.5 db, whereas, 


when 2-db steps are used, the mean 
threshold is underestimated by only 1.0 
db. The observed difference of 1.7 db 
appears to be in relatively good agree- 
ment with this theoretical expectation. 

Thus, for both the spondee threshold 
and the 1000-cps threshold, after allow- 
ance has been made for differences in 
the size of intensity steps used, there 
appears to be very little difference 
between results obtained by a clinical 
versus a laboratory procedure. 

Table 3 shows that there is no large 
or systematic effect of sex on the audi- 
tory acuity of young normals. Females 
average 0.7 db better for the 1000-cps 
threshold, but males are 0.7 db better 
for the spondee threshold. Both differ- 
ences are small. 

Table 4 reveals a slight advantage 
(1.3 db) for the right ear at 1000 cps. 
However, this effect is not maintained 
for the spondee threshold, so that evi- 
dence of a slight tendency for one ear 
to have greater acuity is lacking. 

Table 5 suggests that the effect of 
test order is not critical. The 1000-cps 
threshold SPL is slightly lower (1.5 
db) when it is obtained first, but the 
spondee threshold SPL is slightly high- 
er (1.3 db) when it is measured first. 

Table 6 summarizes the effect of 
prior knowledge of the spondee vocab- 
ulary on the spondee threshold. The 
threshold SPL is 2.7 db lower for sub- 
jects who were read the complete list 
of 36 words prior to the test than for 
subjects not given prior knowledge. 
The possibility of a sampling error, 
such that subjects assigned to the prior 
knowledge group had inherently keener 
threshold sensitivity than subjects in 
the group without prior knowledge, is 
minimized by the fact that these same 
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Taste 7. Summary of analysis of variance for 
1000-cps threshold data. 








Source df ms F 





Main Effects 


Method (a) 1 63.7 3.56 

Sex (b) 1 ih ae | 0.62 

Ear (c) 1 42.7 2.38 

Order (d) 1 i. || $43 

Prior Knowledge (e) 1 8.9 0.50 
Interactions 

exe 1 255.5 14.26* 
Within Sub-Classes 64 17.9 








*Significant beyond the 1%' level. All other 
interactions were nonsignificant and are not re- 
ported here. 


subjects did not yield a lewer threshold 
SPL for 1000 cps. The mean 1000-cps 
threshold SPL for the prior knowledge 
group was, in fact, 0.7 db higher than 
the mean 1000-cps threshold SPL for 
subjects without prior knowledge of 
the spondee vocabulary. 

This 2.7-db difference between 
spondee threshold SPL for subjects 
with and without prior knowledge of 
the spondee vocabulary was the largest 
effect observed over the five factors 
studied, and, as subsequent discussion 
shows, was the only effect that could 
be considered significant on the basis 
of statistical analysis. 

Tables 7 and 8 summarize analyses 


Taste 8. Summary of analysis of variance for 
spondee threshold data. 











Source af ms F 
Main Effects 
Method 1 0.0 0.00 
Sex A 13.3 1.21 
Ear ] 0.9 0.08 
Order 1 42.8 3.89 
Prior Knowledge 1 169.3 15.41* 
Interactions} 
Within Sub-Classes 64 110 








*Exceeds value required for the 1% level of 
confidence. 
tNone of the interactions are reported since 
none exceeds value required for the 1% level of 
confidence. 


of variance performed on the 1000-cps 
and spondee threshold data, respec- 
tively. Inspection of Table 7 shows 
that, for the 1000-cps threshold, none 
of the five factors studied appears to 
exert a significant effect, and there are 
no particular interactions worthy of 
note. Only one F ratio, that for the 
interaction between the ear tested and 
prior knowledge of spondee vocabu- 
lary (cxe), is significant beyond the 
1% level (F=7.08). Since it is difficult 
to conceive a logical basis for an actual 
interdependence between these two 
factors to exist, insofar as the threshold 
for 1000 cps is concerned, the authors 
choose to conclude that this F ratio 
represents that chance occurrence 
under the null hypothesis which will, in 
fact, happen 1% of the time in a large 
series of F ratios computed on the 
basis of pairs of samples drawn from 
identical populations. 

Table 8 shows a similar, relatively 
negative, set of results for the spondee 
threshold analysis. Again, only one F 
ratio is significant beyond the 1% level. 
Here however, it is a main effect, 
knowledge of test vocabulary, with a 
difference of 2.7 db between spondee 
thresholds for subjects with and with- 
out prior knowledge of test vocabulary, 
which is significant. This finding is 
interpreted as evidence that preliminary 
exposure of audiologically naive sub- 
jects to a list of spondee words lowers 
the measured threshold SPL slightly 
(2 to 3 db). All other F ratios obtained 
in the analyses of variance for spondee 
thresholds were nonsignificant. This 
situation, coupled with the totally neg- 
ative findings in the companion analysis 
of the data on acuity for 1000 cps 
leads to the conclusion that systematic 
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TaBLE 9. Mean 1000-cps threshold sound pressure levels (db re: 0.0002 microbar in ASA type-1 
coupler) for each method of measurement and mean spondee threshold sound pressure levels for each 
method of measurement and for subjects with and without prior knowledge of spondee test vocabulary. 











Method 1000-cps Threshold Spondee Thresholds 
Without With Combined 
Prior Knowledge Prior Knowledge 
Up-and-Down 8.1 22.3 20.9 21 
Clinical 9.8 23.6 19.7 21.7 
Combined 9.0 23.0 20.3 2 








effects due to method of testing, sex, 
ear, and order old meéasure- 
ment are either nonexistent or are so 
small that they are obscured by the 
uncontrollable variables in the investi- 
gation. In either event, the implication 
is clear. These four factors did not 
produce effects so large that the effects 
modified the measured threshhold levels 
substantially. Hence, the influence of 
sex, ear, and order of test may be disre- 
garded in examining the data at hand 
with the aim of assessing the relation- 
ship between acuity for 1000 cps and 
acuity for spondees. Technically, the 
same conclusion applies to the effect of 
method of test, but common sense 
argues that the size of the interval used 
in testing (which differed for pure 
tones in the two methods of test) 
should not be ignored completely. The 
rationale underlying this last statement 
and the bases for other general conclu- 


sions are highlighted by Tables 9 and 
10. 


Table 9 gives mean thresholds, sub- 
divided in terms of method of test and, 
for the spondee thresholds, the factor 
of prior knowledge. Table 10 reports 
the differences, for the same break- 
down of data, between the 1000-cps 
and the spondee thresholds. Appropri- 
ate combined values are also reported 
in both tables. The following conclu- 
sions seem pertinent and reasonable. 

First, as seen in Table 9, and as men- 
tioned earlier, the mean thresholds for 
1000 cps appear essentially equivalent 
when allowance is made for the fact 
that the clinical method used a 5-db 
step as contrasted to the 2-db step 
employed in the up-and-down method. 
Thus, it would appear that this thresh- 
old can serve as a stable point of ref- 
erence when the size of the test inter- 
val is specified. 

Second, the mean thresholds for 
spondaic words varied appreciably. As 
already pointed out, the variation with 
method of testing can be considered 


TaBLE 10. Difference between mean 1000-cps threshold sound pressure levels (db re: 0.0002 microbar 
in ASA type-1 coupler) and mean spondee threshold sound pressure levels for each method of measure- 
ment and for subjects with and without prior knowledge of test vocebulary. 








Method 





Prior Knowledge Combined 
Without With 
Up-and-Down 14.2 12.8 13.5 
Clinical 13.8 9.9 11.9 
Combined 14.0 11.3 12.6 
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random, but the variation due to famil- 
iarity with the spondees is systematic. 
The important point is the fact that 
the speech threshold is not a point of 
reference whose stability is comparable 
to that of the 1000-cps_ threshold. 
Therefore, the establishment of an 
audiometric norm for speech requires 
the designation of such additional con- 
ditions as: (a) the specific test material 
on which the norm is based and (b) 
the audiological sophistication of the 
subjects who are the reference group. 
Third, as the foregoing conclusions 
imply and as Table 10 illustrates, the 
difference between the thresholds for 
the 1000-cps pure tone and for the W-1 
spondees varies substantially. This vari- 
ation is primarily the result of the in- 
stability of the spondee threshold, al- 
though the change in the size of the 
interval used to measure acuity for 
1000 cps is thought to have exerted its 
influence as well. The outstanding 
point is that the observed differences 
range from 9.9 to 14.2 db, with the 


average for all conditions combined - 


being 12.6 db. These differences are all 
substantially greater than the 6-db value 
currently designated in the American 
Standards for Speech Audiometers (2). 
They are in reasonable agreement with 
several earlier investigations (3, 4, 6), 
and smaller than reported by other 
writers (9, 11). 


Discussion 


The practical implications of the 
present study have already been par- 
tially stated. These implications are: 
(a) that a multiplicity of conditions 
must be specified in order to stabilize 
the norm for speech audiometry, (b) 
that the difference between the norms 


for pure-tone audiometry and for 
speech is a function of the conditions 
chosen in specifying both but (c) that 
the difference between the norms for 
1000 cps and for speech should be 
designated as substantially more than 
6 db. 

The magnitude of the difference 
which is selected as specifying the 
standard relation between the threshold 
for 1000-cps pure tone and threshold 
for speech must be settled by arbitrary 
decision of the persons responsible for 
establishment of standards. The present 
study can assist these persons only to 
the degree that it helps to clarify the 
factors to be considered. 

To this end, it is instructive to ex- 
amine the differences between thresh- 
olds exhibited by the 20 subjects used 
in the preliminary study (see Table 
1) and to compare these results with 
the findings already reported for the 
96 subjects who participated in the 
main experiment. 

It probably represents approximately 
the limiting range to be encountered 
when subjects involved have ‘normal’ 
acuity. In other words, the evidence at 
hand leads one to the belief that, when 
the size of the test interval is constant 
for both measures, highly sophisticated 
listeners will detect a 1000-cps pure 
tone at a sound-pressure level about 
10 db weaker than the level at which 
they correctly repeat 50% of the W-1 
words. This difference may become as 
great as 17 db in consequence of com- 
plete unfamiliarity with the spondee 
test materials. Since the threshold for 
1000 cps is but little affected by audio- 
logical sophistication, the difference 
may fluctuate over a range of 7 or 8 
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db as a result of variation in the thresh- 
old for speech. 

Assuming that the foregoing analysis 
is correct, three choices are available 
in choosing a ‘standard’ difference to 
be incorporated in a revision of audio- 
metric norms. 


If highly sophisticated listeners are 
selected as the criterion population, a 
difference of about 10 db must be spec- 
ified. The practical consequence of 
such a choice will be that the naive 
listener (including many a_hard-of- 
hearing person) will yield initial thresh- 
olds which are several decibels poorer 
than his later ones will be, particularly 
if time is not taken to familiarize him 
with the test words prior to the initial 
test. 

If the fully naive listener is chosen 
as the standard, the reverse situation 
will exist. A difference of 15 or 16 db 
must now be specified, and any person 
having appreciable prior experience 
with the speech material will obtain 
thresholds which appear better by sev- 
eral decibels than they otherwise 
would. 

If a difference of intermediate value 
(12 or 13 db) is specified, this choice 
would imply that a moderately so- 
phisticated listener is the criterion, and 
it would keep the discrepancy small 
(circa 3 db) in the measurement of 
threshold for other tvpcs of listeners. 

This choice (that is, establishing the 
difference between the norm for 1000 
cps and the norm for speech at 12 to 13 
db) would seem to be most reasonable. 
It represents a ‘middle-of-the-road’ 
course if one is thinking in terms of 
spondee words as represented by W-1 
recording and also if one is contem- 
plating the array of evidence which 


studies other than the present one have 
supplied. Moreover, since the spondaic 
words have relatively high audibility, 
even audiologically sophisticated _lis- 
teners will exhibit a difference between 
modalities of at least 12 or 13 db when 
the speech threshold is determined 
with other types of material. 


Summary 


In order to determine the intensity 
difference between normal hearing for 
spondee words and normal hearing for 
a 1000-cps pure tone, both types of 
threshold were measured in 10 audio- 
logically sophisticated and 106 audio- 
logically naive listeners. 

The threshold for 1000 cps was 
found to be ielatively independent of 
the sophistication factor, but prior 
familiarity with the spondee test vo- 
cabulary exerted a significant influence 
on the spondee threshold. 

Results indicate that the intensity 
difference between normal hearing for 
a 1000-cps pure tone and for spondee 
words is considerably larger than 6 db. 
It ranged from 9.4 to 17.1 db depend- 
ing on variations in the type of sub- 
ject, the familiarity of speech materials, 
and aspects of the measurement pro- 
cedure. A difference of 13 db is ap- 
proximately medial. It represents a 
value which might properly be selected 
as the relationship to be specified for 
audiometric standards. 
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Some Factors Affecting the Spondee Threshold 
in Normal-Hearing Subjects 


TOM W. TILLMAN 


JAMES F. JERGER 


The effects of subject sophistication on 
auditory test results vary with the par- 
ticular stimulus being employed to 
assess auditory acuity. For example, 
Jerger and others (2) found that essen- 
tially equivalent pure-tone thresholds 
are obtained for highly sophisticated, 
highly practiced listeners and for lis- 
teners who have had no previous exper- 
ience in pure-tone audiometric examin- 
ations. However, the same experiment 
suggested that the spondee threshold 
SPL for sophisticated subjects, who 
were relatively familiar with the 
spondee vocabulary, was some 5 to 
7 db lower than that yielded by in- 
experienced subjects who were not 
familiar with the test vocabulary as 
such. It was found, moreover, that 
if spondee threshold measurements 
were repeated in these two groups, the 
second threshold SPL for the inexperi- 
enced subjects was approximately 3 db 
lower than the initial threshold SPL, 
while there was essentially no differ- 
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ence in the two thresholds for the 
sophisticated group. 

The above findings led these same 
investigators (2) to formulate a more 
extensive experiment utilizing 96 unso- 
phisticated normal-hearing subjects 
between the ages of 18 and 24 years. 
The design ot the experiment allowed 
the assessment of the effect of prior 
knowledge of the test vocabulary on 
the spondee threshold. Prior knowledge 
of test vocabulary was imparted to 
half of the subjects by: reading the 
entire list of spondee words to them 
immediately preceding the measure- 
ment of the spondee threshold. The 
mean spondee threshold SPL for the 
group given previous knowledge of the 
test vocabulary was approximately 3 
db better than that for the group not 
given such knowledge. 

Since the 3-db improvement ob- 
served in the preliminary experiment 
could have been due, in part, to knowl- 
edge of test vocabulary and, in part, 
to practice in the task of responding to 
words at threshold intensities, the pres- 
ent study was designed to isolate the 
effects of these two variables: (a) 
knowledge of spondee test vocabulary, 
and (b) practice in the task of respond- 
ing to faint spondee words. 
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Apparatus 


The core of the apparatus used to 
measure spondee and 1000-cps thresh- 
olds consisted of a commercially avail- 
able speech audiometer (Grason-Stad- 
ler, type 162) feeding a PDR-10 ear- 
phone mounted in an MX41/AR 
cushion. All spondee thresholds were 
obtained by playing portions of re- 
corded List E of CID Auditory Test 
W-1 through this speech audiometer. 

A separate pure-tone source, con- 
sisting of a commercially available clin- 
ical audiometer (Beltone, model 15-A), 
was fed through the speech audiometer 
to the same PDR-10 earphone. This 
portion of the apparatus was utilized in 
obtaining thresholds for a pure tone 
of 1000 cps. 

The electrical signal across the ear- 
phone was measured periodically with 
a vacuum-tube voltmeter (Hewlett- 
Packard, model 400A). The acoustic 
output of the apparatus was calibrated 
by means of an ASA ‘Type-1 coupler, 
calibrated condenser microphone 
(Western Electric, type 640AA), cath- 
ode follower (ADC, . D5153), and 
vacuum-tube voltmeter (Hewlett- 
Packard, model 400A). All speech and 
pure-tone thresholds reported below 
are, therefore, expressed as the sound 
pressure level, reference 0.0002 micro- 
bar, developed in an ASA Type-1 
coupler. 

The sound pressure levei developed 
by the earphone at octave frequencies 
from 125 to 8000 cps was measured 
before beginning the experimental test- 
ing and at three-day intervals through- 
out the course of the testing. The max- 
imum variation in output over the test- 
ing period at the test frequency of 
1000 cps was 0.5 db. This variation 


equalled or exceeded the maximum 
variation which occurred at any octave 
frequency from 250 to 4000 cps. How- 
ever, maximum variations in output of 
0.6 and 2.5 db occurred during the 
testing period at 125 and 8000 cps, re- 
spectively. Prior to the experiment, 
measurements demonstrated that the 
speech audiometer equalled or exceeded 
all specifications listed in the American 
Standard Specifications for Speech 
Audiometers (7). 


Subjects and Procedure 


Three groups of 10 subjects each 
were selected from the female student 
body at Northwestern University. The 
age range of each group was restricted 
to 18 to 24 years, and no subject was 
accepted who reported any history of 
ear pathology, excessive noise exposure 
or previous experience as a listener in 
auditory tests involving speech stimuli. 
All prospective subjects were screened 
audiometrically at a hearing level (hear- 
ing loss dial setting) of 10 db, refer- 
ence USPHS norm, at each octave in- 
terval from 125 to 8000 cps, as well as 
at 1500, 3000, and 6000 cps. No subject 
was accepted who failed to respond to 
any test frequency at the screening 
intensity in either ear. Only one ear of 
each subject was used in the collection 
of the data reported below. The right 
ear was tested in half of the subjects, 
the left ear in the remainder. Subjects 
accepted for the experiment were as- 
signed to the three experimental groups 
in a haphazard and _nonsystematic 
fashion. 

Each subject was seen in a single 
experimental session during which three 
thresholds were measured, the thresh- 
old for 1000-cps pure tone and two 
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separate thresholds for spondee words. 
The measurement of the threshold for 
the 1000-cps tone always followed the 
initial spondee threshold and preceded 
the second spondee threshold. 


Group A (Practice Only). One 
group of 10 subjects, hereafter referred 
to as Group A, was included to test for 
the effects of practice only. In this 
group the first spondee threshold was 
obtained by playing words 1 through 
18 of recorded List E of CID Auditory 
Test W-1. The second spondee thresh- 
old was obtained by playing words 19 
through 36 of the same recording. The 
effects of practice alone on the spondee 
threshold could then be evaluated by 
comparing the mean threshold obtained 
in the first test with the mean thresh- 
old obtained in the second test. 


Group B (Practice and Possible Prior 
Knowledge). A second group of 10 
subjects, hereafter referred to as Group 
B, was included to assess the combined 
effects of practice and possible knowl- 
edge of test vocabulary, prior to the 
second threshold. In this group, the 
first spondee threshold was obtained 
by playing words i through 18 of 
recorded List E of CID Auditory Test 
W-1. The second spondee threshold 
was then obtained by playing the same 
18 words again. The effects of practice 
plus possible knowledge of test vocab- 
ulary could then be evaluated by com- 
paring the magnitude of the two mean 
thresholds thus obtained. 


Group C (Practice and Definite Prior 
Knowledge). A third group of 10 sub- 
jects, Group C, was formed to evaluate 
the combined effects of practice and 
definite prior knowledge of test vocab- 
ulary on the spondee threshold. In this 


Tillman, Jerger: Spondee Threshold 143 


group, both spondee thresholds were 
obtained by playing words 1 through 
18 of recorded List E of CID Auditory 
Test W-1. In addition, each subject 
was given prior knowledge of the test 
vocabulary by hearing the first 18 
words of this list, in the manner de- 
scribed below, immediately preceding 
the measurement of the first spondee 
threshold. The effects of definite prior 
knowledge of test vocabulary could 
then be evaluated by comparing the 
mean initial threshold for this group 
with the mean initial thresholds ob- 
tained for Groups A and B. 

All spondee thresholds were ob- 
tained by the up-and-down method, 
described in detail by Jerger and others 
(2), utilizing 2-db steps of attenuation. 
This method was chosen in preference 
to a simpler and less time-corsuming 
clinical procedure because it allowed 
each threshold to be defined using ex- 
actly the same number of stimulus 
presentations. In other words, each 
spondee threshold was defined by pre- 
senting a series of 18 spondee words 
using the up-and-down method of 
threshold measurement. As noted ear- 
lier, these 18 words were always either 
words 1 through 18 or words 19 
through 36 of recorded List E of CID 
Auditory Test W-1. 

A clinical procedure utilizing 2-db 
steps of attenuation was employed to 
measure each subject’s threshold for a 
1000-cps pure tone. The procedure em- 
ployed was the ascending technique 
described in the 1951 revision of the 
‘Manual for School Hearing Conserva- 
tion Programs’ (3) prepared by the 
Committee on Conservation of Hearing 
of the American Academy of Oph- 
thalmology and Otolaryngology. 
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Taste 1. Mean spondee and 1000-cps threshold sound pressure levels (db re: 0.0002 microbar in 
ASA type-1 coupler) and their standard errors for three groups of 10 normal hearing subjects each: 
Group A, practice alone; Group B, practice and possible prior knowledge; Group C, practice and definite 


prior knowledge. 








First Second 
Spondee Threshold Spondee Threshold 1000 cps 











Group A 
Mean Threshold SPL 25.9 24.8 9.3 
Standard Error of Mean 1.0 1.3 1.3 
Group B 
Mean Threshold SPL 27.0 24.6 10.9 
Standard Error of Mean 1.2 1.1 1.6 
Group C 
Mean Threshold SPL 20.9 20.6 10.7 
Standard Error of Mean 1.0 0.6 1.2 
The instructions read to subjects in Results 


the present study preceding the exper- 
imental session were essentially iden- 
tical to those used by Jerger and others 
(2). The only changes were necessi- 
tated by the fact that the latter study 
employed only two test runs, while 
the present study used three test runs. 

In addition to the instructions noted 
above, which were read to all subjects, 
the following instructions were read 
to the prior knowledge group immedi- 
ately before the initial spondee thresh- 
old was measured. 


Before the initial word test, I will read a 
series of 18 two-syllable words at a level 
which is easy for you to hear. You are 
to repeat back each word. These words 
are the same ones that you will later hear 
in the two word tests; however, they will 
be in a different order. Since the purpose 
of this initial reading is to make you 
familiar with the words, please listen care- 
fully. 


The first 18 words of List E of CID 
Auditory Test W-1 were then pro- 
nounced in alphabetical order by mon- 
itored live voice at a level 50 db above 


the present American norm of 22 db 
SPL. 


Table 1 summarizes the experimental 
data. Group A, in which the first spon- 
dee threshold was obtained using the 
first 18 words of recorded List E and 
the second threshold using the second 
18 words of the same recorded list, 
showed an improvement in threshold 
acuity of 1.1 db from the first to the 
second test. This is a very slight dif- 
ference, and it would seem hazardous 
to conclude that it resulted from prac- 
tice in the task of responding to spon- 
dee words at threshold intensities. ‘This 
seems particularly true in view of the 
essentially equivalent thresholds for the 
two spondee threshold tests in Group 
C. In this latter group, the two thresh- 
olds were obtained using the first 
18 words of recorded list E after the 
subjects had been given previous ex- 
perience with the test words. The 0.3- 
db improvement in this group from the 
first to the second test could hardly 
indicate an important practice effect. 

In Group B an improvement of 2.4 
db occurred from the first to the sec- 
ond spondee threshold test. The two 
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thresholds in this group were obtained 
using the first 18 words of recorded 
List E. This group, it will be recalled, 
was included to assess the effect of 
possible knowledge of test vocabulary 
imparted to the subjects during the first 
spondee test as well as the effects of 
practice. In view of the relatively small 
change which occurred from the first 
to the second thresholds in Group A 
and the essentially equivalent thresholds 
obtained in Group C, it seems reason- 
able to conclude that the improvement 
noted in Group B was due to the 
knowledge of test vocabulary gained 
by the subjects as a result of the initial 
test. 

Comparison of both the first and 
second spondee thresholds obtained in 
Group C (definite prior knowledge of 
the test vocabulary) with those for 
Group A (no prior knowledge) re- 
veals that Group C yielded threshold 
SPLs 4 to 5 db lower than Group A. 
It seems reasonable to attribute this 
relatively large difference solely to the 
fact that Group C had been given prior 
knowledge of the test vocabulary. 


The above findings suggest that, at 
least insofar as short-term effects are 
concerned, practice in the task of re- 
sponding to spondee words at near- 
threshold intensities exerts no important 
influence on the spondee threshold in 
normal hearing subjects. Conversely, 
it would appear that previous knowl- 
edge of the spondee test vocabulary 
has an important effect. Specifically, 
threshold SPLs as much as 4 to 5 db 
lower will be obtained from subjects 
who are given previous knowledge of 
the test vocabulary than from subjects 
who are not given this prior knowledge. 


These findings have practical impli- 
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cations for the clinician as well as for 
those concerned with establishing a 
normative standard for the hearing of 
speech stimuli.;In the routine clinical 
procedures requiring successive meas- 
urements of the speech threshold using 
spondee words, the initial threshold 
SPL established may be 4 to 5 db higher 
than succeeding ones unless time is 
taken to familiarize the subject with 
the test vocabulary preceding the in- 
itial threshold test. 

In establishing an audiometric norm 
for speech, where spondee words are 
specified as the stimulus, one must ap- 
parently be concerned with the extent 
to which the criterion population is 
familiar with the test vocabulary. Such 
concern is necessary because the abso- 
lute value of the standard, as well as 
the relationship between hearing for 
pure tones and for speech, will be in- 
fluenced rather importantly by the 
degree to which the criterion popula- 
tion has prior familiarity with the test 
vocabulary. 


Summary 


The purpose of this experiment was 
to isolate the effects on the spondee 
threshold in normal-hearing subjects 
of (a) knowledge of spondee test vo- 
cabulary and (b) practice in the task 
of responding to spondee words at 
threshold intensities. 

Two spondee thresholds were ob- 
tained for each of 30 subjects under 
varying experimental conditions util- 
izing recorded List E of CID Auditory 
Test W-1. 

Results suggest two conclusions: 
(a) short-term practice in the task of 
responding to spondee words at thres- 
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hold intensities exerts no important in- 
fluence on the spondee threshold SPL 
in normal-hearing subjects; and (b) 
normal-hearing subjects given prior 
knowledge of the test vocabulary yield 
spondee threshold SPLs which are 4 
to 5 db lower than those yielded by 
subjects not given such knowledge. 
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Validity of the One-Frequency Screening 
Principle for Public School Children 


IRA M. VENTRY 


HAYES A. NEWBY 


The increasing acceptance afforded 
public school hearing conservation pro- 
grams has pointed up the need for audi- 
tory screening devices that are efficient, 
economical, and practical. A recent de- 
velopment in screening tests has been 
the single-frequency test reported by 
House and Glorig (2, 3, 4, 5). The ra- 
tionale for testing at one frequency 
(4000 cps) is that the loss at this fre- 
quency appears to be as great as or 
greater than the loss at any lower fre- 
quency. Davis, Hoople, and Parrack 
(1) have stated that an empirical rule 
about hearing loss seems to be that 
hearing is very seldom worse for any 
frequency important for speech than 
it is for 4000 cps. They also have said 
that the generality of this rule, at least 
for adults (present authors’ italics), has 
only very recently been realized. Thus 
it would appear that testing at 4000 
cps would detect hearing loss not only 
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at this frequency but also at lower fre- 
quencies. 

The purpose of this study was to in- 
vestigate whether the generality of the 
single-frequency principle can be ex- 
tended to include school children. Two 
null hypotheses were formulated for 
statistical testing of the principle in- 
volved: (a) there is no difference be- 
tween the mean threshold losses from 
500 through 8000 cps; (b) the mean 
threshold loss at 4000 cps is not dif- 
ferent from the mean threshold losses 
above and below 4000 cps. 


Procedure 


With a standardized five-frequency 
sweep test, 1517 first, third, and fifth 
grade children were screened individ- 
ually. The fail criterion for the sweep 
test was an inability to hear any one 
frequency in either ear at the screening 
levels of 20 db for 500 and 1000 cps 
and 15 db for 2000, 4000, and 8000 cps. 
Every fourth child failing the sweep 
test was given a pure-tone air-conduc- 
tion threshold test at the above five 
frequencies on the ear(s) failing the 
sweep test. The threshold tests were 
administered by an experienced audiol- 
ogist using a calibrated Maico E-2 au- 
diometer. All threshold tests were con- 
ducted in the school rooms that had 
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been utilized for the sweep testing. 
Standard audiometric procedures were 
employed in the determination of 
thresholds. Special care, of course, was 
taken to insure the honesty and reliabil- 
ity of a subject’s response. 

A total of 62 subjects (90 ears) re- 
ceived threshold tests. The distribution 
of males and females was approximate- 
ly equal with 32 males and 30 females. 
The mean age of all the children was 
105.7 months with a range of from 77 
to 139 months. Data were analyzed for 
each of the three grades separately and 
for all grades combined. 


Results 


Analysis of First Grade Threshold 
Data. Threshold tests were received by 
23 first grade children, 14 males and 
nine females. Their mean age was 84.4 
months with a range of from 77 to 93 
months. A total of 31 ears received 
threshold tests and the analysis is based 
on these 31 earst. McNemar’s (6) Case 
VIII Mixed Model analysis of variance 
was employed to test the hypothesis 
(a) above. The mean losses at the five 
frequencies tested are shown in Table 
1. Table 2 presents a summary of the 
analysis of variance, the results of 
which indicate that there are significant 


It might be argued that the use of two 
ears of one subject, at least in some cases, 
does not take into account the relationship 
between the ears of one subject. The same 
type of analysis, however, was employed 
utilizing only one ear of a subject (N sub- 
jects now equaled N ears). The statistical 
trend remained unchanged. For example, 
when the threshold results for all left ears 
were analyzed, the difference between 4000 
cps and the frequency showing the next 
highest mean threshold loss was highly sig- 
nificant (P<.001). This same level of sig- 
nificance was reached when all the right 
ear threshold data were analyzed. 


differences between the mean threshold 
losses from 500 through 8000 cps (F= 
3.55 and P < .01). 

The second hypothesis (b) was then 
tested by evaluating the difference be- 
tween the mean threshold loss at 4000 
cps and the threshold loss at 1000 cps. 
Rejection of the hypothesis of no dif- 
ference between 4000 cps and the next 
highest mean threshold loss (in this 
case 16.77 db at 1000 cps) indicates a 
significant difference between 4000 cps 
and each of the other frequencies. The 
difference between these two corre- 
lated means was tested by an F test?. 
The resultant F equaled 19.84 and in- 
dicated that the difference between the 
mean threshold loss at 1000 and at 
4000 cps is highly significant (P < 
001, df=1 and 30). The implications 
of these results are obvious and will be 
discussed later. 

Analysis of Third Grade Threshold 
Data, Exactly the same analysis as the 
above was made on the threshold re- 
sults of 22 third graders, 10 males and 
12 females. These subjects had a mean 
age of 108.0 months with a range of 
from 101 to 122 months. Threshold 
tests were received by 34 ears. Table 
1 presents the mean threshold losses at 
each of the five frequencies, and the 
results in Table 2 indicate that there is 
a highly significant difference (P< 
.001) between these mean losses. When 
the difference between 1000 and 4000 
cps was subjected to statistical test, the 
resultant F of 39.44 was found to be 
highly significant (P < .001, df = 1 
and 33). 


*The formula used in this case was F = 
(X.1 — X.2)?/ =(Dr — D)*/ R(R — 1), 
where 7 = 1 and m =R — 1 (6). 
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TaBLe 1. Mean threshold losses in db sensation levels for first, third, and fifth grade ears and for 


all ears combined. 








Frequencies (cps) 





Grade N 500 1000 2000 4000 8000 

1 31 16.29 16.77 14.35 21.77 15.97 
3 34 22.06 22.21 20.59 31.18 17.79 
5 25 15.40 14.00 14.00 23.60 11.20 
All 90 18.22 18.06 16.61 25.83 15.33 








Analysis of Fifth Grade Threshold 
Data. Threshold tests were received by 
17 fifth grade subjects, eight males and 
nine females (25 ears). The mean age 
of these children was 131.4 months 
with a range of from 128 to 139 
months. Table 1 presents the mean 
threshold losses from 500 through 8000 
cps and Table 2 indicates that the dif- 
ference between these mean losses is 
significant (P < .01). The F test com- 
paring the mean threshold losses at 500 
and 4000 cps resulted in a significant F 
of 6.47 (P < .05, df = 1 and 24). 


Analysis of Combined Threshold 
Data. For the 62 first, third, and fifth 


grade subjects combined, Table 1 again 
illustrates the mean losses at each of the 
frequencies tested. Table 2 shows that 
the differences between the mean losses 
is highly significant (P < .001). The F 
test of the difference between the two 
greatest losses (500 and 4000 cps) re- 
sulted in an F of 47.04 which, once 
more, is highly significant (P < .001, 
df = 1 and 89). 


Discussion 


The results of the statistical analyses 
indicate that, for the population of 
school children from which the ex- 
perimental group might be considered 


TaBLe 2. Analyses of variance for testing the differences between mean threshold losses for first, 
third, and fifth grade ears and for all ears combined. 











Source df ms F Pp 
First grade 
Frequency 4 245.21 3.55 01 
Ears 30 191.25 
Remainder 120 68.98 
Total 154 
Third grade 
Frequency a 861.18 11.40 .001 
Ears 33 800.59 
Remainder 132 75.56 
Total 169 
Fifth grade 
Frequency 4 553.20 4.53 .O1 
Ears 24 309.12 
Remainder 96 122.05 
Total 124 
All ears 
Frequency ay 1496.09 18.47 .001 
Ears 89 487 .87 
Remainder 356 81.02 


Total 
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a random sample, the concept upon 
which one-frequency screening rests is 
a valid one. The two null hypotheses 
formulated—(a) that there is no differ- 
ence between mean threshold losses 
and (b) that the loss at 4000 cps is not 
different from the loss at any other 
frequency tested—are rejected at sta- 
tistically significant levels. 

The validity of the one-frequency 
concept is given further support when 
another factor is analyzed. It will be 
noted (Table 1) that the frequencies 
showing the highest mean loss, exclud- 
ing 4000 cps, are 500 and 1000 cps. It 
is commonly recognized, however, 
that hearing at these frequencies may 
be affected by ambient noise levels 
found in a testing environment, partic- 
ularly in a school test situation. The 
same ambient noise levels have much 
less effect on hearing for the higher 
frequencies. The threshold tests con- 
ducted in this study took place, out of 
necessity, in an environment that was 
considerably noisier than the average 
test rooms employed in clinical audi- 
ometry. It is reasonable to assume that 
had the threshold tests been conducted 
in a quieter environment, even greater 
differences between the hearing losses 
at 500 and 1000 cps and the loss at 
4000 cps would have been obtained. 
Yet, even in an unfavorable testing 
environment, the loss at 4000 cps re- 
mained significantly greater than the 
mean losses at these two lower fre- 
quencies. 


One further comment might be made 
with respect to the previous analyses. 
The only frequencies included in this 
study were 500, 1000, 2000, 4000, and 
8000 cps. An important research prob- 
lem would be an evaluation of the gen- 


erality of the one-frequency principle 
by the inclusion of 1500, 3000, and 
6000 cps in the analysis of threshold 
results to determine whether the loss 
at 4000 cps is greater than the losses at 
these other frequencies as well. Evi- 
dence was obtained that the loss at 4000 
cps is greater than the loss at at least 
four of the traditional octave test fre- 
quencies. Whether the inclusion of 
other test frequencies, particularly the 
higher ones, would call for a revision of 
the principle remains to be explored. 
The importance of the need for this 
research is indicated by the growing 
emphasis on 3000 and 6000 cps as stand- 
ard audiometric test frequencies. 


Summary 


The single-frequency screening prin- 
ciple was evaluated for school children 
by conducting pure-tone air-conduc- 
tion threshold tests on every fourth 
subject failing a standardized five-fre- 
quency sweep test. A total of 1517 first, 
third, and fifth grade children received 
individual sweep tests; threshold re- 
sults were obtained on 90 ears (62 
subjects). The results of the threshold 
analyses indicated that there were sig- 
nificant differences between the mean 
threshold losses from 500 through 8000 
cps. Further statistical analyses pro- 
vided evidence that the mean threshold 
loss at 4000 cps is greater than the 
mean threshold loss at any of the other 
four frequencies tested. The validity of 
the single-frequency principle, at least 
under the conditions of this study, ap- 
pears to be established. One-frequency 
screening, however, needs to be further 
evaluated to determine the efficiency 
of this type of screening in detecting 
medically significant hearing loss in 
children. 
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Dysphasic Speech Responses 
to Visual Word Stimuli 


GERALD M. SIEGEL 


This study was designed to investigate 
the speech behavior of dysphasic sub- 
jects in terms of certain operationally 
defined aspects of words. These aspects, 
or word parameters, were so chosen 
that generalizations might be made 
concerning various theories of dys- 
phasia. 

Considerable research in dysphasia 
has indicated that there are character- 
istic patterns of difficulty in the speech 
of dysphasic patients. Investigators 
have attempted to analyze dysphasic 
speech performance in terms of as- 
pects of language and to identify those 
language parameters that typically cre- 
ate difficulty for the patient. Where 
such characteristic patterns of dys- 
phasic difficulty have been found, or 
postulated, attempts have been made 
to explain them in terms of a larger 
theoretical framework concerning the 
entire problem of dysphasia. 
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Review of Literature 

Recency of Learning. Freud (9, pp. 
87-89), Nielsen (17, p. 18) and Weis- 
enburg and McBride (26, p. 96) have 
described dysphasia as being charac- 
terized in part by specific impairment 
of relatively recently learned material 
in contrast to more remote learning. 
An outgrowth of this theory is the con- 
tention that bilingual dysphasics typi- 
cally retain greater ability in their 
mother tongue than in a more recently 
acquired language. 

Difficulty of the Task. It has been 
suggested also that a factor contrib- 
uting to dysphasic speech performance 
is the relative difficulty of the speech 
task. Thurston (23) found that on an 
oral spelling task dysphasic subjects had 
a disproportionate increase in number 
of errors as the words became more 
difficult. 


Part of Speec. Several writers have 
indicated that dysphasics have particu- 
lar difficulty in handling certain parts 
of speech. Freud (9, p. 34), Goldstein 
(11, p. 246), Weisenburg and McBride 
(26, p. 299), and Nielsen (17, p. 71) 
all describe a condition of ‘amnesic 
aphasia’ which is characterized by dis- 
turbance of the ability to use nouns 
and, in more severe cases, other parts 
of speech. Wepman (27, p. 236) de- 
scribes a condition of ‘anomia’ which 
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involves difficulty in the use of com- 
mon names. In the same source Wep- 
man (27, p. 153) describes the speech 
recovery process as one in which com- 
mon names for things (nouns) tend 
to be the first relearned patterns. 

Eisenson (8, p. 14) attributes the 
supposed increased difficulty with 
nouns to the high frequency of occur- 
rence of this particular part of speech 
in normal language rather than to any 
peculiarity of the nouns themselves. 
Brown and Schuell (3) and Schuell 
(21) have argued that dysphasics suffer 
generalized impairment of the ability 
voluntarily to select any specific word, 
regardless of the part of speech. Simi- 
larly, Olhoeft’s (18) dysphasic pa- 
tients failed to demonstrate, in ability 
to utter words, substantial differences 
among five parts of speech. 

Wepman and others (28) recently 
compared a sample of the verbal output 
of one ‘anomic’ patient with that of a 
normal subject and a derived normal 
model of spoken English. They re- 
ported that for their ‘anomic’ patient 
there was a tendency for the retention 
of frequently occurring words and for 
relatively greater loss of less frequently 
occurring words when the verbal out- 
put of their dysphasic patient was com- 
pared to that of their control subject 
and the normal model. The authors 
suggest that the dysphasic subject’s 
language repertoire may be related to a 
factor of ‘word generality,’ which is in 
turn related to ‘frequency of occur- 
rence.’ 


Specific Phonetic Factors. According 
to Head (14, p. 121) neither the part of 
speech nor the specific phonetic com- 
position should be considered an inde- 
pendent factor in dysphasia. Several 
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writers have, however, indicated that 
certain phonetic combinations may be 
more difficult than others for dys- 
phasics. Goldstein (J1, p. 43) and 
Huber (15) allude to specific phonetic 
difficulty while Anderson (J) and 
Schuell (20) have tended to discount 
the notion of any specific constellation 
of articulation errors among dysphasics. 
Schuell found articulation errors to be 
inconsistent and variable among dys- 
phasics. 


Levels of Word Abstraction. Ac- 
cording to Goldstein (11, 12) one of 
the sequelae of the brain pathology 
that precipitates dysphasia is impaired 
capacity for abstraction and the tend- 
ency to respond in a concrete manner 
and to concrete aspects of complex 
situations. This ‘behavioral dedifferen- 
tiation,’ as Goldstein calls it, is con- 
sidered a total personality change in- 
volving marked reorganization of the 
patient’s responses to abstract situations. 
Just as situations vary in inherent ab- 
stractness, words may conceivably vary 
with reference to level of abstraction. 
As words become increasingly ab- 
stract, they may become correspond- 
ingly more difficult for the dysphasic 
patient whose ability to handle abstract 
material is impaired. 

Eisenson (7), Wepman (27), and 
Brown (2) have not concurred with 
Goldstein as to the prevalence of im- 
pairment of ability to abstract among 
dysphasics. 

Evaluating the role of ‘word ab- 
straction level’ in dysphasic speech is 
complicated by the fact that there is 
no generally accepted operational defi- 
nition of this construct and, to the 
knowledge of the writer, no scale of 
level of abstraction of words. 
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Word Length. Attempts have been 
made to relate dysphasic speech errors 
to word length. Word length, however, 
is a complex variable that may be re- 
lated to any of a number of either 
auditory or visual criteria or even to 
so remote a factor as grammatical part 
of speech. Writers have variously re- 
ferred to the ‘small words’ of speech 
in terms of the number of letters and 
syllables or as connoting the so-called 
‘lesser’ parts of speech (such as articles, 
conjunctions, and prepositions). 

Goldstein (J1, p. 70) and Wepman 
(27, p. 153), apparently referring to 
' these lesser parts of speech, report that 
dysphasics have particular difficulty 
with ‘small words.’ Schuell (20) also 
found that there may be increased diffi- 
culty with small words but she related 
this to the auditory or visual vividness 
and prominence of words rather than 
to part of speech. 


It is an interesting fact that for both 
Schuell and Goldstein this factor of 
word lengtl. has meaning for a general 
theory of dysphasia. Goldstein ex- 
plains it in terms of his theory of im- 
pairment of abstraction. He indicated 
that these so-called small words do not 
normally present difficulty for dys- 
phasics when encountered in sentences 
since in this context they are concrete- 
ly attached to either preceding or fol- 
lowing words. When encountered in 
isolation, however, without contextual 
attachments, these words become quite 
abstract and therefore much more dif- 
ficult for the dysphasic to handle. 

Schuell, on the other hand, considers 
difficulty with small words in terms of 
her basic theory of the influence of 
impairment of ‘reauditorization’ on dys- 
phasic speech. She suggests that short 


words are more difficult for dysphasics 
than longer words because short words 
tend to be indistinct in connected 
speech and are consequently more dif- 
ficult to reauditorize. 


The Problem. Various authors have 
attempted to present a lawful explana- 
tion for the patterning of speech errors 
in dysphasic verbalization. Dysphasic 
speech behavior has been related to the 
factors of recency of learning, relative 
difficulty of the task, part of speech, 
phonetic composition, level of abstrac- 
tion, and word length. 


The present investigation was de- 
signed to study the relationship be- 
tween dysphasic speech behavior and 
three of these factors: part of speech, 
word length, and level of word ab- 
straction. Specifically, the following 
question was posed: Do dysphasic indi- 
viduals show consistent patterns of 
speech error in response to word stim- 
uli which are classified according to (a) 
part of speech, (b) level of abstraction, 
and (c) word length? 


Procedure 


Subjects. The subjects were 31 pa- 
tients diagnosed as dysphasic and were 
drawn from 11 hospitals and treatment 
centers. They ranged in age from 20 
to 66 years with a median age of 51.3 
years. Five subjects had a history of 
cortical trauma and 26 subjects had 
suffered a cerebro-vascular accident. 
Median time in school was 9.1 years 
with all but two subjects having com- 
pleted eight or more years. The group 
consisted of 24 males and seven females 
all of whom correctly produced no 
fewer than 10% and no more than 
90% of the 90 responses required. 
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Subjects were not screened for dys- 


arthria because it was felt that there 
is no satisfactory technique for dif- 
ferentiating a so-called dysarthric re- 
sponse from a dysphasic response. Simi- 
larly, such categories as ‘receptive, 
‘expressive,’ ‘nominal,’ and ‘amnesic’ 
were not considered since, as Weisen- 
berg and McBride (26, p. 47) have 
pointed out, ‘Nowhere in the aphasia 
literature is confusion more evident 
than in the classification of types of 
disorder.’ The reliability and stability 
of even such widely used classification 
systems as those of Eisenson (6) and 
Wepman (27) have been questioned 
(19). 


Stimulus Material. The stimulus ma- 
terial for this study consisted of a set 
of 90 words typed in uppercase letters 
one-fourth inch high on individual 4- 
by-6-inch white cards. The words were 
drawn from graded word lists (4, 10) 
and were designated as being no higher 
than fifth grade level. By controlling 
educational achievement of the sub- 
jects and grade level of the words some 
control was exercised also over the 
difficulty and recency of learning of 
the stimuli. 

Each of the 90 words was classified 
according to part of speech, word 
length, and level of abstraction. Within 
each parameter, subclassifications were 
defined so that there were three parts 
of speech (nouns, verbs, adjectives), 
two word lengths (short, long), and 
three levels of abstraction (low, medi- 
um, high). 

Part of speech was determined by 
reference to a standard dictionary 
(25). Word length was established by 
a criterion involving number of letters 
and number of syllables. ‘Long’ words 


contained two or more syllables and 
six or more letters. ‘Short’ words were 
single-syllable words of four or fewer 
letters. Since level of abstraction is a 
construct for which there are no ob- 
jective criteria, an operational proce- 
dure which is essentially an equal- 
appearing-interval scaling ‘method (24) 
was employed. This procedure is de- 
scribed by Darley, Sherman, and Siegel 
(5). 

The 90 words comprising the word 
list were chosen by selecting 30 words 
at each defined level of abstraction in 
such a way that for each abstraction 
level there was equal representation in 
terms of the other parameters: part of 
speech and word length. The word list 
was so devised that when the subclassi- 
fications of any one parameter were 
considered, the influence of the other 
two parameters was counterbalanced. 
Thus, there were 30 nouns, 30 verbs, 
and 30 adjectives; there were 45 short 
and 45 long words; there were 30 words 
at each of the three levels of abstrac- 
tion. 


Responses. The subjects were re- 
quired to utter each of the 90 stimulus 
words as the cards upon which they 
were typed were presented in a random 
order. Subjects were tested individually 
at the various participating hospitals 
and centers. All testing sessions were 
conducted solely by the experimenter 
and were tape recorded. The subjects’ 
responses were later replayed and eval- 
uated by the investigator as being either 
right or wrong. If response latency was 
greater than 15 seconds or if no re- 
sponse at all was elicited, the response 
was scored incorrect. Beyond this, re- 
sponses were scored wrong if they 
were judged by the investigator to be 
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Taste 1. Sum and mean number of errors for 
each subclassification of part of speech, word 
length, and word abstraction level for 31 dysphasice 
individuals. 








Classification Sum of Errors Mean of Errors 
Part of Speech 





Nouns 323 - 10.42 

Verbs 404 13.03 

Adjectives 405 13.06 
Length 

Short 478 15.42 

Long 654 21.09 
Abstraction Level 

Low 379 12.23 

Medium 356 11.48 

High 397 12.81 








significantly deviant from the accept- 
able utterance of the words. 

"One week after the original judging 
the experimenter rescored the tape- 
recorded responses of 10 of the subjects 
(900 discrete responses) and correlated 
these ratings with the original judg- 
ments for these 10 subjects. The for- 
mula for the correlation of a four-fold 
point surface (13) was applied and a 
phi coefficient of .90 was obtained. 


Results 


Statistical Analysis. Obtained differ- 
ences for each of the three parameters 
studied (level of abstraction, word 
length, part of speech) were evaluated 
by separate statistical analyses. For 
each analysis the criterion measure used 
was the number of wrong responses, 
and the level of significance selected 
was 5%. The balanced construction of 


TaBLe 2. Analysis of variance of the errors made 
by dysphasie subjects on nouns, verbs, and ad- 
jectives. 











Source df ms F 
Part of Speech (P) 2 71.48 9.46* 
Subjects (S) 30 142.58 
PS 60 7.55 
Total 92 








*F 01 (df = 2 and 30) = 8.77. 


the word list made possible the use of 
a treatments-by-subjects design (16) 
or, in the case of only two treatments, 
as in the word length analysis, the 
simple ¢ test for related samples. 

In Table 1 are presented the sum and 
the mean number of errors in each of 
the subclassifications for each parameter 
studied. 


Parts of Speech. Data reported in 
Table 1 indicate that of the three parts 
of speech, adjectives incurred the great- 
est number of errors while nouns in- 
curred the smallest number of errors. 
Subjects responded almost identically 
to verbs and to adjectives. The analysis 


TaB.e 3. Analysis of variance of the errors made 
by dysphasie subjects on words of low, medium, 
and high levels of abstraction. 











Source df ms F 
Abstraction (A) 2 13.63 7. 25* 
Subjects (S) 30 142.58 
AS 60 1.88 

Total 92 








*F os (df = 2 and 30) = 6.35. 


of variance results, summarized in 
Table 2, indicate significant differences 
among parts of speech. The difference 
necessary for significance at the 5% 
level is 1.37. Error scores for both verbs 
and adjectives are significantly greater 
than for nouns. The difference between 
verbs and adjectives is not significant. 
These findings do not support the 
hypothesis that the nominal aspects of 
speech offer selective difficulty for dys- 
phasics. 


Length. The mean number of errors 
for the short words was 15.42 com- 
pared to 21.09 for long words (see 
Table 1). The obtained t¢ of 4.35 is 
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highly significant (t1=3.65, df=30). 
It is evident that subjects had consider- 
ably more difficulty with long words 
than with shorter ones. 


Level of Abstraction. The analysis of 
variance results summarized in Table 3 
indicate significant differences among 
levels of abstraction. The difference 
necessary for significance at the 5% 
level is .72. The error scores (see Table 
1) for both high-abstraction-level 
words and low-abstraction-level words 
are significantly greater than for me- 
dium-abstraction-level words. The dif- 
ference between high and low levels of 
abstraction is not significant. 


Discussion 


In evaluating the results of this study 
it was clearly recognized that the par- 
ticular investigative approach used is 
only one of the several that might have 
been employed and that in the very 
process of attempting rigorous defini- 
tion of the stimuli and the subjects a 
certain amount of information was 
either ignored or lost and a certain 
amount of artificiality was introduced. 
The investigator was particularly aware 
of the fact that words in isolation do 
not necessarily affect dysphasic speech 
in the same way that words in context 


do. 


Part of Speech. The results obtained 
are in conflict with the traditional no- 
tion that dysphasics suffer a selective 
impairment of the ability to use nouns, 
that is, some sort of ‘anomic’ or ‘nom- 
inal’ impairment. These results tend to 
corroborate studies (3, 8, 18, 20, 21) 
that dispute the hypothesis of special- 
ized difficulty with one or another part 
of speech. The writer can offer no ex- 
planation, on the other hand, as to why 
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nouns should have incurred fewer 
errors than did the other parts of 
speech. It seems no more logical to 
suggest a special facility with nouns 
than to suggest some special difficulty 
with nouns. 


Word Length. There were signifi- 
cantly more errors on long than on 
short words. According to Goldstein 
(11), presentation of ‘small words’ in 
isolation tends to invoke more error 
responses from dysphasics than presen- 
tation of the same words in context 
because the lack of context increases 
the degree of abstraction of these 
words. In the present study, however, 
the abstraction levels of the two groups 
of words had been counterbalanced. 
If the salient feature of these small 
words is their increased abstraction, the 
process of counterbalancing abstraction 
level should have equated them in terms 
of their difficulty for dysphasics. The 
predominance of errors on long words 
indicates that this was not the case. It 
is true, of course, that the small words 
used were not the so-called ‘lesser’ parts 
of speech specified by Goldstein and 
that the procedures employed for 
specifying abstraction level do not nec- 
essarily relate to his concept of word 
abstraction. 


Schuell (20) suggests that small 
words may be selectively difficult in 
contextual speech because they tend to 
be slighted and are given scant auditory 
prominence. In the present study, 
words were presented in isolation 
rather than in context. Schuell also 
has suggested that with increasing 
length of stimulus unit more dysphasic 
errors occur because of increased dif- 
ficulty in reauditorizing. If this is so, 
one would expect that the longer words 








would be more difficult to reauditorize. 
Such an expectation is borne out in the 
present study. 


Level of Abstraction. The finding 
that high-abstraction-level words were 
more difficult for dysphasics than 
medium-abstraction-level words tends 
to support the Goldstein (1/, 12) hy- 
pothesis of impairment relative to de- 
gree of abstraction. ‘Iwo reversals were 
found, however. High- and _ low- 
abstraction-level words did not differ 
significantly and low-abstraction-level 
words were missed significantly more 
often than medium-level words. 

It was felt that these contradictory 
findings (and in some measure the 
findings of the part-of-speech analysis) 
might be related to some aspect of the 
stimulus words that had not been con- 
sidered in the initial formulation of 
the word list. One such possibility was 
suggested in an article by Wepman 
and others (28). Wepman reported 
that his dysphasic patient had a marked 
tendency toward the excessive use of 
words which occur frequently in writ- 
ten English usage and the under use of 
those which occur less frequently. The 
word list in the present study was 
selected without reference to frequen- 
cy of occurrence in language usage. 
In order to investigate this factor the 
frequency of occurrence of each of the 
90 words in the list was ascertained 
from the Thorndike and Lorge (22) 
word list. Forty-eight of the stimulus 
words were found to be within the 
2000 most frequently occurring words 
in written English usage. These were 
designated as comprising a ‘frequent’ 
level. The remaining 42 words were 
distributed among the third through 
the tenth thousand most frequently oc- 
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Taste 4. The number of frequent (N=48) and 
infrequent (N=42) words falling into each sub- 
classification of part of speech, length, and level 
of abstraction. 











Subclassifications Frequent Infrequent 
Words Words 

Part of Speech 

Nouns 19 tt 

Verbs 19 11 

Adjectives 10 20 
Length 

Short 28 17 

Long 20 25 
Abstraction Level 

Low 4 26 

Medium 20 10 

High 24 6 








curring words in written English usage 
and comprised an ‘infrequent’ level. 

Since the number of words in each 
category differed, the means could not 
be compared directly. For each sub- 
ject it was necessary to convert the 
number of errors made on frequent 
words to a proportion of the total num- 
ber of frequent words in the list (48 
words) and to convert the number of 
errors on infrequent words to a similar 
proportion using 42 as the basis for the 
proportion. These proportions were 
then treated as scores and a ¢ test for 
related samples was applied. The value 
of t, 4.87, was highly significant (¢,9:—= 
3.65, df=30). As suggested by Wep- 
man, there was greater impairment 
for words occurring less frequently in 
written language usage. 

Since frequency of occurrence had 
not been counterbalanced in the orig- 
inal word list, interpretation of the data 
becomes complicated. The apportion- 
ment of the two frequency levels in 
each of the other subclassifications is 
presented in Table 4. If frequency of 
occurrence is a basic factor in deter- 
mining dysphasic performance, it is ap- 
parent that this influence was unequally 
distributed. 


Siegel: 


Reviewing the part-of-speech analy- 
sis in the light of data for frequency of 
occurrence, it becomes evident that 
frequency of occurrence alone cannot 
account for the errors made on the 
three parts of speech. The subjects 
made more errors on verbs than nouns 
even though there were identical num- 
bers of frequent and infrequent words 
within these two parts of speech. 
Further, despite the greater number of 
frequent words among verbs in contrast 
to adjectives, the number of errors 
committed on these two parts of speech 
was not significantly different. 


It is typical of the English language 
that long words tend to be used less 
frequently than short ones (29). This 
fact is reflected in the word list used 
in the present study (Table 4). The 
relatively high incidence of errors made 
on both long and infrequent words may 
be the result of an interrelationship be- 
tween these two factors. It would be 
interesting to evaluate independently 
the influence on dysphasic speech of 
length and frequency of occurrence. 

The high and medium abstraction 
levels were approximately the same in 
terms of the distribution between the 
two frequency levels (Table 4); the 
effect of frequency may thus be con- 
sidered counterbalanced in this com- 
parison. The finding that medium-ab- 
straction-level words were more dif- 
ficult perhaps reflects a ‘true’ abstrac- 
tion function in which the higher the 
level the greater the difficulty of the 
words. 

The fact that there were 26 infre- 
quent words in the low abstraction 
level in contrast to only six such words 
in the high abstraction level may indi- 
cate that the influence of the abstrac- 
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tion level factor in this comparison was 
counteracted, thus accounting for the 
lack of a significant difference in er- 
rors on high- and low-abstraction-level 
words. 

In comparing words of low abstrac- 
tion level with those of medium ab- 
straction level, there were many more 
infrequent words in the low-abstrac- 
tion-level group and it is again conceiv- 
able that the factor of frequency was 
responsible for the greater number of 
errors on low-abstraction-level words 
than on medium-abstraction-level 
words. 

It may be noted that frequently used 
words tended to be judged as high in 
abstraction level. This is understandable 
since frequently occurring words must 
serve a number of language functions 
and, as such, have numerous referents. 
This characteristic is one of the basic 
criteria for designating a word as being 
high in abstraction level. 

It appears that frequency of occur- 
rence is related to, but not synonymous 
with, the other parameters defined in 
this investigation. Perhaps somewhere 
in this pattern of interrelationships re- 
sides a comprehensive principle of 
dysphasic language performance. 

The influence of frequency of oc- 
currence on the other parameters can- 
not be interpreted in this study; nor 
can the difference in errors obtained 
for the two frequency levels confident- 
ly be evaluated because of possible con- 
tamination by the original three param- 
eters which were not _ counterbal- 
anced with frequency. It does appear, 
however, that frequency of occurrence 
should be defined as a variable in study- 
ing dysphasic speech performance and 
that this factor deserves further investi- 
gation. 








Summary 


Speech behavior of 31 dysphasic sub- 
jects was investigated in terms of sev- 
eral word parameters: part of speech, 
level of abstraction, length, and fre- 
quency of occurrence in written Eng- 
lish usage. The obtained results indicate 
that dysphasics make more errors on 
adjectives than on either verbs or nouns 
(no evidence of ‘anomia’ was found), 
more on long words than on short; 
more on words of both high and low 
abstraction leve! than on those of me- 
dium level; and more on words occur- 
ring infrequently in the language than 
on those occurring frequently. 
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Scaling of Abstraction Level of Single Words 


FREDERIC L. DARLEY 
DOROTHY SHERMAN 


GERALD M. SIEGEL 


Development of a quantitative and op- 
erational definition of ‘word abstraction 
level,’ one aspect of Siegel’s (14) inves- 
tigation of dysphasic speech perform- 
ance, is the subject of this report. The 
Siegel investigation posed the question 
as to whether certain characteristic pat- 
terns of speech difficulty emerge when 
dysphasic individuals are required to 
respond to visual word stimuli which 
have been classified according to a 
number of parameters. One of the pa- 
rameters selected for study was the 
level of abstraction of word stimuli. 
Impairment of the ‘abstract attitude’ 
has been considered by some to be a 
primary consequence of brain damage 
and an essential aspect of the dysphasic 
syndrome. For Goldstein (9) the no- 
tion of impairment of the abstract atti- 
tude is an organizing concept which 
explains much of the dysphasic subject’s 
behavior. ‘In the concrete attitude we 
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are given over passively and bound 
to the immediate experience of unique 
objects or situations. Our thinking and 
acting are determined by the immedi- 
ate claims made by the particular aspect 
of the object or situation . . .,’ while in 
the abstract attitude ‘we transcend the 
immediately given specific aspect of 
sense impressions; we detach ourselves 
from the latter and consider the situa- 
tion from a conceptual point of view 
and react accordingly’ (9, p. 6). Gold- 
stein believes that one of the sequelae of 
the brain pathology that precipitates 
dysphasia is this impaired capacity for 
abstraction: 
Organic pathology in patients with brain 
disease disintegrates human behavior in 
such a way that the capacity for abstract 
behavior is impaired to greater or lesser 
degree in the patient. Thus, he becomes 
more or less reduced to a level of con- 
creteness . . . so that he can perform only 
those tasks which can be fulfilled in a 
concrete manner (10, p. 9). 
The behavioral disintegration (or 
‘dedifferentiation, as Goldstein calls 
it) that results from this impairment of 
abstraction is considered a total person- 
ality change in which the patient’s re- 
sponses to complex situations undergo 
marked reorganization, with the patient 
more typically functioning in the ‘con- 
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crete mode’ described. Goldstein indi- 
cates that there are various degrees of 
both abstract and concrete behavior 
and that 

... it must be emphasized that the process 

of dedifferentiation does not entirely de- 

stroy the ability to generate ideas and 
thought. What is affected and modified 
is the way in which thoughts and ideas are 
dealt with. Thoughts do arise, but can 
become effective only in a concrete way 

(ssp) 7) s 

Goldstein’s views have been accepted 
and supported by some writers (J, 2, 
16), while others have questioned his 
analysis on methodological grounds 
(11, 12, 18) or because of contrary 
clinical or research observations (3, J, 
Ie, 37). 

The degree of a subject’s capacity 
for abstraction may be reflected in 
given speech performances where de- 
grees of abstraction are involved. It 
would seem a reasonable extension of 
the Goldstein hypothesis to expect 
dysphasics to respond selectively to 
words as a function of their level of 
abstraction, if ‘level of abstraction’ 
could somehow be acceptably defined. 
In the syndrome Goldstein describes as 
‘amnesic aphasia’ the outstanding symp- 
tom reported is the inability to perform 
‘true’ naming (in contrast to ‘pseudo- 
naming,’ which involves merely the 
association of a word with an object). 
This inability is expressed primarily in 
the use of nouns. According to Gold- 
stein, the process of naming is quite 
an abstract one. Thus the difficulty 
which amnesic aphasics have in han- 
dling nouns may be thought to be di- 
rectly related to impairment of the 
ability to abstract. 

Eisenson (6, p. 2) has commented on 
dysphasic performance relative to word 
abstraction level: 


Aphasic difficulty is likely to increase in 
the realm of abstract words for which 
there is no basic objectivity. We cannot 
easily concretize such conceptual terms as 
beauty, humorous, tragic, incongruous, 
liberal, hope, love, atomic energy, etc. To 
explain these terms we would have to use 
other terms. Similarly, we would have 
great difficulty indicating objectively the 
meaning of such words as if, however, 
about. Most of us who develop language 
facility learn how to use these terms in 
conventional and appropriate ways. We 
reveal our meanings through them and 
understand other persons’ meanings as ex- 
pressed with these nonobjective terms. 
These, interestingly enough, are the very 
terms which cause the greatest difficulty, 
both in comprehension and production 
(spoken or written) for most aphasic 
patients. 


Evaluation of the role of ‘word ab- 
straction level’ in the speech of the 
dysphasic is complicated by the fact 
that there is no generally accepted 
operational definition of ‘word abstrac- 
tion level.” One may agree with Eisen- 
son that the words cited are ‘abstract,’ 
but he provides no cues as to their rel- 
ative level of abstraction nor as to how 
dysphasics respond to shades of ab- 
straction. Typically, where the prob- 
lem of abstraction has been considered, 
it has been related to performance skills 
other than speech. 

Flesch (7, p. 384) suggested a quan- 
titative means of specifying the abstrac- 
tion level of a verbal passage. In con- 
sidering this problem he wrote: 

Though the significance of the level of 

abstraction is widely recognized, no studies 

have been reported that attempt to esti- 
mate or measure it quantitatively. Discus- 
sions are usually limited to generalized 
distinctions between abstract and concrete 
words, illustrative examples, and such fig- 


urative devices as the semanticists’ ‘ab- 
straction ladder,’ 


Continuing, Flesch assumes that ‘level 
of abstractness can be ascertained by 
computing the ratio of certain parts 
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of speech to certain other parts of 
speech in written expression.’ He arbi- 
trarily designated certain parts of 
speech as ‘definite’ (concrete) and 
others as ‘non-definite.’ By computing 
the ratio of ‘definite’ to ‘non-definite’ 
words in a given passage, he obtained 
what he called his ‘abstraction index.’ 
Unfortunately no adequate rationale 
was provided for the designating of 
given words as either ‘definite’ or ‘non- 
definite.’ Though this procedure yields 
a quantitative value, there is no assur- 
ance that the criteria upon which it 
is based are valid. A more fruitful 
approach would seem to be first to 
define some operation by which the 
level of abstraction of individual words 
can be specified. 

Johnson (73) has written extensively 
concerning the process of abstraction 
and has devised an ‘abstraction ladder’ 
of the sort Flesch possibly had refer- 
ence to. Commenting on the diagram 
(13, p. 135) he presents, Johnson (13, 
p. 140) writes: 

It is to be clearly understood, of course, 

that the precise level of abstraction on 

which a statement—or a word—rests can- 
not be designated. Our diagram of the 
process of abstracting is not to be read like 

a thermometer. This is not to say that the 

notion of level of abstraction might not 

lend itself to investigation by means of 
fairly reliable rating scales. Such an in- 


vestigation has not yet been undertaken, 
but it doubtless will be in due time. 


So far as is known to the writers, no 
such scale had been formulated until 
the present investigation was under- 
taken. 


Procedure 


The method developed to provide a 
quantitative and operational definition 


of ‘word abstraction level’ consisted 
essentially of an equal-appearing-inter- 
val scaling procedure, after the method 
devised by Thurstone and Chave (15). 
A group of 35 judges, graduate students 
registered for courses in speech pathol- 
ogy or semantics, scaled the abstraction 
level of each of 572 words. The word 
list was comprised of 201 nouns, 201 
verbs, and 170 adjectives; words that 
are frequently used as more than one 
part of speech were excluded. All 
words were drawn from the Reading 
Vocabulary for the Primary Grades 
by Gates (8) and A Combined Word 
List by Buckingham and Dolch (4); 
the words selected were graded as no 
higher than fifth grade by at least one 
of these sources. 

Words were classified into two cate- 
gories with regard to length, ‘short’ 
and ‘long.’ Although the specification 
of what is ‘short’ or ‘long’ is necessarily 
arbitrary, a criterion involving both 
number of letters and number of sylla- 
bles was selected as most adequate; in 
the case of visual presentation these are. 
probably the most prominent aspects 
of length for the reader; furthermore, 
these aspects can be precisely specified. 
‘Short’ words, of which there were 
267, were defined as_ single-syllable 
words of four or fewer letters. ‘Long’ 
words, of which there were 305, were 
defined as words of two or more sylla- 
bles containing six or more letters. 

Each of the judges received a packet 
of mimeographed materials containing 
a set of instructions and the 572 words 
arranged in random order on 8” x 10” 
sheets. The instructions contained spe- 
cific directions regarding precedures 
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to be followed in making the ratings 
and a general discussion of the concept 
of word abstraction, which is here 
presented verbatim: 


In the present investigation we are at- 
tempting to determine the Jevel of abstrac- 
tion of a number of words. Specific words, 
whether nouns, verbs, or adjectives, vary 
in their level of abstraction. It will be your 
task to scale these words on the basis of 
your judgment of the level of abstraction 
of each of them. 

The level of abstraction of a word may 
depend on a number of considerations. 
Words tend to become more abstract as 
they become more general in their refer- 
ence or as they tend to leave out details. 
Thus, when comparing the abstraction 
level of two such words as ‘cow’ and 
‘animal,’ we might make our judgment on 
the basis of which of these two words 
contains the least specific information and 
tends to be a more general or a more in- 
clusive category. A similar judgment may 
be made for verbs, such as ‘walk’ and 
‘travel.’ 

Words also become increasingly abstract 
as they become further removed from a 
specific reference. In this respect those 
words are Jeast abstract that can be defined 


only by example, by demonstrating what - 


they are used to name or to stand for. The 
more abstract the word, the more difficult 
it is to find some means of ‘pointing at’ the 
meaning of the word. 


Undoubtedly there are other ways of 
explaining the concept of word abstrac- 
tion. The remarks above are merely in- 
tended to be indications of the kind of 
criteria that may be helpful in making this 
judgment. 

After a reading of the instructions 
and the answering of any questions 
raised about procedure, each judge 
worked individually and at his own 
rate. A five-point equal-appearing-in- 
tervals scale was employed, with J 
standing for least abstract and 5 for 
most abstract. Judging was completed 
in a single session which varied for 
individuals from one-half hour to an 
hour. 


Results 


Reliability. Median scale values of 
abstraction were computed for each of 
572 words from the responses of 35 
judges, employing a five-point equal- 
appearing-intervals scale. The semi- 
interquartile range, Q, which measures 
the dispersion of judgments, was ob- 
tained for each scale value. The mean 
Q value is 0.64. The range of Q values 
is from .20 to 1.24, and the standard 
deviation is .14. Only seven of the 572 
Q values are larger than 1.00 and only 
17 larger than .90. Thus, insofar as a 
measure of the dispersion of judgments 
is also a measure of reliability, the ob- 
tained scale values appear to be satisfac- 
torily reliable. 

Pearson’s correlation procedure was 
employed for further evaluation of the 
reliability of the obtained scale values. 
Two sets of mean scale values were 
computed for a sample of 90 words, the 
words which comprised the experi- 
mental material in the study reported 
by Siegel (14). Each of the two sets 
of scale values was computed from the 
responses of 17 judges, randomly as- 
signed for this purpose from the entire 
group of 35 judges (the responses of 
one randomly-selected judge were 
omitted to provide an equal number 
in each group). The obtained r for 
estimating the strength of relationship 
between the two sets of scale values 
is .91. The size of the obtained 7 indi- 
cates high agreement between the two 
sets of values and provides strong evi- 
dence that the method of scaling re- 
sults in satisfactorily reliable measures 
of abstraction of single words. 


Median Scale Values. The particular 
method of computation 


employed 
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Ficure 1. Distribution of level of abstraction 
scale values for 572 words. Values given 
along the abscissa are midpoints of degree of 
abstraction intervals. 


allows a possible range of obtained 
scale values from .50 to 4.50. The mean 
of the obtained scale values is 2.50, 
which is at the midpoint of the possible 
range. The obtained range is from .53 
to 4.43, and the standard deviation is 
95, 

The distribution of the obtained 
values, as shown in Figure 1, is essen- 
tially symmetrical and unimodal. Al- 
though there is no evidence of the true 
form of the distribution for the popula- 
tion from which the obtained 572 
values are a sample, it seems reasonable 
to expect it to be both unimodal and 
symmetrical. The method employed 
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for scaling the level of abstraction of 
words may thus be assumed to be use- 
ful for experimental purposes. 

An end effect, that is, a piling up of 
words at the extremes of the scale, was 
anticipated as a result typical of the 
method of equal-appearing intervals. 
Such was not the case, however. The 
observed peaking at the extreme lower 
end of the scale is to be accounted for 
by the disproportionately high number 
of nouns, which, in general, were 
judged to be low in level of abstraction. 

Figure 2 presents distributions, in 
percentages, of level of abstraction scale 
values for nouns, verbs, and adjectives, 
each classified as to length, ‘short’ and 
‘long’: 94 short nouns, 107 long nouns, 
98 short verbs, 103 long verbs, 75 short 
adjectives, aud 95 long adjectives. It 
can be seen that the distributions of 
nouns are markedly skewed, with a 
preponderance of low-scale-value 
words. The distributions of verbs are 
more nearly symmetrical, while the dis- 
tributions of adjectives are markedly 
skewed, with a preponderance of high- 
scale-value words. Average scale values 
increase in this order: short nouns, long 
nouns, short verbs, long: verbs, short 
adjectives, and long adjectives. The 
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Figure 2. Distributions, in percentages, of level of abstraction scale values for nouns, verbs, 
and adjectives, each classified as to length, ‘short’ and ‘long. Values given along the abscissas 


are midpoints of degree of abstraction intervals. 
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entire range of scale values is repre- 
sented among the short nouns, long 
nouns, and long adjectives; in the classi- 
fications of short verbs, long verbs, and 
short adjectives the entire range is rep- 
resented with the exception of the 
lowest level of abstraction. 

For the purposes of the investigation 
of which this scaling experiment was 
a part, three levels of abstraction were 
defined. Scale value limits were estab- 
lished to correspond to ‘low abstrac- 
tion’ (Level I), ‘medium abstraction’ 
(Level If), and ‘high abstraction’ 
(Level III). Of the total list of 572 
words, the 143, or 25%, which re- 
ceived the lowest median scale values 
were designated as constituting Level I; 
the 143, or 25%, which received the 
highest scale values were designated as 
constituting Level III; and another 143 
words, or the middle 25%, were desig- 
nated as constituting Level II. Specifi- 
cally, words falling between the scale 
values of .50 and 1.63 were considered 
of low abstraction level; words falling 
between values of 2.41 and 2.88, of 


TasL_E 1. Examples of words representing the 
three parts of speech at each of three levels of 
abstraction as scaled by 35 judges. 











Levels Nouns Verbs Adjectives 
I (‘low’) ant dig bald 
banana fasten chilly 
haystack hum sick 
oat jump skinny 
toe pour wooden 
II (‘medium’) business arrange athletic 
country borrow central 
family explore frisky 
game join hard 
news keep long 
III (‘high’) art am cute 
fame become dreadful 
heaven know honest 
liberty seem mere 
problem succeed nice 








medium abstraction level; and words 
falling between the values of 3.10 and 
4.50, of high abstraction level. Table 1 
presents examples of words’ represent- 
ing the three parts of speech at each 
of the three levels of abstraction. 

It should be acknowledged that the 
procedures outlined above constitute 
only one of several approaches to the 
problem of specifying abstraction level, 
and this approach may relate only im- 
perfectly to Goldstein’s concept of 
word abstraction. Goldstein, however, 
has provided no alternative method of 
quantifying his concept of word ab- 
straction. Further study of this param- 
eter of language appears warranted in 
order to refine research procedures in- 
tended to specify the language prob- 
lems of dysphasic subjects and to de- 
velop more comprehensive theories of 
the learning and decay of language. 


Summary 


Median scale values of level of ab- 


‘straction were obtained for each of 


572 words (nouns, verbs, adjectives) 
from the responses of 35 judges who 
rated the words on a five-point equal- 
appearing-intervals scale. The distri- 
bution of scale values was essentially 
unimodal and symmetrical. Statistical 
analysis provided evidence that the 
abstraction level of single words can 
be reliably scaled. It is suggested that 
this pool of words of scaled abstraction 
level (low, medium, high) provides a 
tool for use in future studies of the 
parameters of language which may op- 


*The total list of 572 words, median scale 
values, and Q values are on file at the Speech 
Clinic, University of Iowa. 
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erate differentially in language learning 
or in such impairments of language as 


dysphasia. 
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Estimates of Intraglottal Pressure During Phonation 


JAMES L. FLANAGAN 


In an earlier report (2) a rationale was 
given for making quasi-steady approxi- 
mations of the acoustic volume velocity 
in the human glottis during phonation. 
Within the conditions imposed by the 
assumptions, it was possible to deduce 
_ wave forms of glottal volume velocity 
from data on glottal area and subglottic 
pressure, and from steady-flow meas- 
urements of the nonlinear glottal re- 
sistance. Essentially the same arguments 
can be invoked to estimate the pressure 
variations within the glottal port. 

As in the earlier report (2), the esti- 
mates derive from measurements re- 
ported by van den Berg and others 
(3) on the resistance of the glottal 
orifice to steady flow. To fix terms, 
a schematic representation of the vocal 
folds of an adult male is shown in 
Figure 1. The folds are approximately 
3 mm in thickness, L. The glottal open- 
ing is approximated as a rectangular 
slit, 18 mm in length, /, and variable 
in width, w. The subglottic pressure 
(re atmospheric) is P,, and the pres- 
sures at the entry and outlet of the 
glottis are P; and Ps, respectively. The 
‘downstream’ (that is, pharyngeal) 
pressure is P,, and for cases of open 
articulation is, in comparison to P,, 
very nearly atmospheric. From empiri- 
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cal determinations, van den Berg and 
others give the resistance of the glottal 
orifice to steady flow as 


12yL 
lw? 





pQ 

+0.875 sp ; 49 
where Q is the volume velocity, » the 
kinematic coefficient of viscosity, and 
p the density of moist air at body tem- 
perature.’ Expression (1) is claimed to 
be accurate within 10% for: 0.1 mm 
= w = 2.0 mn; P, = 64 cm HO at 
small w; and Q = 2000 cm?/sec at 
large w. 


R=P,/Q= 


If the flow through the orifice were 
ideal, lossless and steady, the distribu- 
tion of pressure within the orifice 
would be as shown in Figure 2(a). The 
pressures at the entry and the outlet 
would be identical, and equal to the 
Bernoulli pressure, Pp. The Bernoulli 
pressure is the kinetic energy per-unit- 
volume possessed by the glottal flow, 
and is 


U2 Q?2 
P; = . => 32 ’ (2) 








where U is the linear velocity of flow, 
and A = lw is the glottal area. In actual 
measurements on plaster cast models of 
human larynges, however, van den 
Berg and others find that for 0.2 mm = 
w = 1.6 mm the distribution of pres- 


In c.g.s. units, the dimensions of R are 
dyne-sec/cm’*, 
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Ficure 1. Schematic representation of vocal 
folds of adult male. 
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sure in the glottis is approximately that 
shown in Figure 2(b). Here, 

P. = —0.5 Pz, (3) 
and (Ps —— P,) SES Ps, (4) 
for 3.6 mm? = A = 28.8 mm*. 

If equation (1) is solved for Q, and 
P, taken as a parameter, then substitu- 
tion into relation (2) yields the Ber- 
noulli pressure as a function of glottal 
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Ficure 2. Distribution of pressure within 
the glottal orifice; after van den Berg and 
others (3): (a) ideal, (b) actual. 
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area, with subglottic pressure the pa- 
rameter. Equation (1) is quadratic in Q, 
and its roots are real. Since the glottal 
volume flow is unipolar, and the glottal 
resistance nonnegative, the positive root 
is the appropriate value of Q. This 
arithmetic has been carried out for three 
different values of subglottic pressure, 
and the results are plotted in Figure 3. 
These data indicate the relationship be- 
tween the Bernoulli pressure and the 
glottal area for subglottic pressures of 
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Ficure 3. Variation of Bernoulli pressure, 
Pz, with glottal area. Computed formally 
from relations in the text. 


4, 8 and 16 cm H2O. The dashed por- 
tions of the curves extend beyond the 
range of areas for which (1) is valid 
within an accuracy of 10%. 

If the subglottic pressure and the 
time function of glottal area are given, 
it is possible, therefore, to deduce from 
(3) and (4) quasi-steady approxima- 
tions for the pressures at the entry and 
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Ficure 4. Single periods of the glottal area 
and pressure functions for an adult male 
phonating /q/ at low intensity. 


exit of the glottal port. To illustrate 
this point, glottal area functions and 
subglottic pressures for two conditions 
of phonation have been taken from the 
previously mentioned earlier report 
(2), and P; and P» have been computed 
for these data. Single periods of the 
area and pressure functions are shown 
for the two cases in Figures + and 5, 
respectively. 

In Figure 4 the data are for an adult 
male phonating the vowel /a/ at lim- 
inal intensity (Ps=4 cm H2Q), and at a 
fundamental frequency of 125 cps. For 
this condition complete closure of the 
glottis never occurs. Figure 5 is for the 
same person producing /a/ at the 
greatest intensity consistent with good 
vowel quality (P,=8 cm HQ), and 
at a fundamental frequency of 125 cps. 
The dashed portions of the P, and P» 
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curves extend beyond the range of 
areas for which (3) and (4) hold. In 
both figures the mean value of P, and 
P, is plotted and labeled P. 

The data show that P; can assume 
values ranging from the subglottic pres- 
sure at complete closure to negative 
values determined by the maximum 
opening and subglottic pressure. The 
pressure P. is always negative, and is 
essentially zero (or atmospheric) for 
complete closure. The fact that the 
pressures at the entry and outlet of the 
glottis can be of different sign suggests 
the possibility of a phase difference in 
the displacements of the lower and 
upper edges of the vocal folds. That 
is, the folds might be blown apart at 
the lower edges while the upper edges 
are being drawn together. The mode of 
vibration is, of course, dependent upon 
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Ficure 5. Single periods of the glottal area 
and pressure functions for an adult male | 
phonating /q/ at hig intensity. 
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Figure 6. Simple mechanical oscillator com- 
posed of a mass, M, a spring, K, and a viscous 
resistance, B. 


the distributed mass and stiffness con- 
stants of the folds, and often appears 
very complex. In the Bell Laboratories 
high-speed film of the vocal cords (1), 
however, a phase difference in dis- 
placement, such as just mentioned, 
seems to be quite evident for low- 
pitched voices in the chest register. At 
higher pitches, where the stiffness of 
the folds is greater and the mass less, 
the upper and lower edges appear to 
move in-phase. 


The pressure data in Figures 4 and 5 
also support observations made on the 
so-called breathy attack which also can 
be seen in the Bell Laboratories film. 
For this condition, vibration starts with 
the folds in the open phase and they are 
initially drawn together. If phonation 
is begun at a relatively large glottal 
area (for example, at T=4 or 5 msec 
in Figures 4 or 5) the mean pressure in 
the glottis is negative, and the initial 
displacement of the folds is to diminish 
glottal area. 

An extremely crude analogy can be 
drawn between the mechanical vibra- 
tion of the vocal folds and that of a 
simple mass-spring-dashpot  arrange- 
ment illustrated in Figure 6. If, under 
conditions of low pitch and medium 
intensity, the displacement of the folds 
is approximated very roughly as 


x(t) =k (1 —cos = t), (5) 
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where T is the fundamental vocal 
period, and k a scale factor, then the 
forcing function necessary to cause the 
simple mechanical circuit to execute 
this displacement is 


f(t) =kff (=2)2M —K]cos 7+ 


+(= =r) Bsin = —" t 1K}. (6) 


Wegel (4) has found that the natural 
frequency of vibration of the folds is 
less than the frequency produced dur- 
ing phonation. ‘Consequently, sustained 
vibration is mass-controlled and the 


Pe : : 
term Cos 4-t in (6) will predominate. 


Inspection of Figures 4 and 5 shows 
that the intraglottal pressure functions 
exhibit roughly this same time relation- 
ship to their displacement (or area) 
functions. 


Summary 


From data previously reported, esti- 
mates are made of the pressure varia- 
tions during phonation at the entry and 
outlet of the glottal port. The estimates 
derive from a previously advanced ra- 
tionale for making quasi-steady approx- 
imations of glottal volume velocities 
and pressures, and are founded upon 
steady-flow measurements of the non- 
linear glottal resistance. 

Calculations for two conditions of 
phonation illustrate that the entry and 
outlet pressures can be of opposite sign, 
and suggest the possibility of a phase 
difference in the motions of the lower 
and upper edges of the vocal folds. In 
the same vein, the associated function 
of mean intraglottal pressure supports 
the observations made on the so-called 
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breathy attack. A crude analogy is 
drawn between the force and displace- 
ment relationships for the vocal folds 
and those of a simple, mass-controlled 
oscillator. 
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Vowel Formant Measurements 


GORDON E. PETERSON 


The development of communication 
theory has aided considerably in de- 
fining the vocal mechanism as the pri- 
mary information source in speech (8). 
Speech is structured according to the 
capabilities and the limitations of this 
mechanism. It is easily demonstrated 
that the resulting acoustical signal does 
not have an entirely unique corre- 
spondence to the positions and move- 
ments of the speech mechanism. Fur- 
ther, the auditory perception of speech 
involves a certain distortion of the 
acoustical signal, in which every lis- 
tener applies his own aural analysis and 
linguistic prejudices. 


Phonetic Analysis 


Speech signals may be classified ac- 
cording to various types of direct vis- 
ual and instrumental observations on 
the physiological mechanism of speech 
production. Observations on the actions 
of the human larynx with a high speed 
motion picture camera (16), for ex- 
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ample, provide a basic means of cate- 
gorizing laryngeal structure and func- 
tion. In fact, the failure to start with 
technical descriptions of speech physi- 
ology and pathology is probably the 
primary source of the present extensive 
confusion about vocal qualities. In a 
related manner, speech signals may be 
classified according to acoustical simi- 
larities, as shown, for example, in sound 
spectrograms. 

It is also possible to classify signals 
according to listener judgments. Vari- 
ous psychophysical techniques may be 
employed in obtaining such judgments. 
Accordingly, certain speech signals 
may be found perceptually indistin- 
guishable or (phonetically) equivalent 
in certain attributes or characteristics. 
While physiological and acoustical 
measurements can be specified and de- 
fined, it is not easy to determine just 
what physical parameters control lis- 
tener judgments about speech signal 
similarities. 

Thus speech can be studied as a 
physical signal independent of language 
and its semantic properties. Such studies 
form the essential subject matter of 
phonetics and are basic to functional 
studies of spoken language. 

It is obvious from texts on physio- 
logical phonetics and from more recent 
publications on vocal tract analogs 
(1, 15) that many physiological di- 
mensions must be specified in order to 
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define a vowel sound accurately. In 
general, however, three major cate- 
gories have been employed to give a 
simplified description of vowels: (a) 
position of the hump of the tongue 
along the vocal tract, (b) maximum 
degree of constriction produced by 
humping the tongue, and (c) degree 
of lip rounding. The relationship of 
acoustical vowel loops to these and 
other parameters has been discussed 
previously (9). A sophisticated tech- 
nique for reducing vowel formations 
to three basic parameters has been 
described by Stevens and House (/4), 
and evaluated by House and Stevens 
(4). More recently it has become evi- 
dent to the present author that in some 
languages at least two additional prop- 
erties of vowels may be involved: (d) 
retrofiection, and (e) velarization of 
front vowels. 

Russell (12) was among the first to 
emphasize that there is not a unique 
correspondence between the _physio- 
logical formations of vowels and the 
acoustical positions of the first few 
formants. Russell, in fact, emphasized 
the difficulties which are involved 
obtaining an adequate physiological 
description or specification of individ- 
ual vowel phonemes. More recently, 
the work of Joos (7) and others has 
stressed the simplicity and convenience 
of describing vowels in acoustical 
terms. The physiological form of 
vowels is probably somewhat more 
fundamental and normally contains a 
greater amount of speaker information, 
but in a favorable acoustical environ- 
ment only a limited amount of informa- 
tion is lost in the transformation from 
physiological movements and positions 
to sound waves. The significant acous- 
tical parameters of vowels appear to 


be few in number, and the ease with 
which these parameters may be ob- 
served and measured in present com- 
munication technology strongly rec- 
ommends the acoustical form of the 
vowel signal for technical study and 
application. 

Acoustical Parameters 

Various types of multidimensional 
signal spaces can be conceived in which 
all of the physical information con- 
tained within an acoustical vowel wave 
may be precisely specified. In the 
acoustical waves of actual speech there 
is a relatively continuous structural 
change, so that any given vowel in an 
utterance would be represented by a 
trace or curve within such a multidi- 
mensional space. There is every reason 
to believe, however, that all of the 
physical details of vowels are not 
equally significant in their formation. 
‘Those parameters which are significant 
may vary, of course, from one language 
to another. To assume that all details of 
vowel stimuli are controlled equally 
well, however, is to assume a generation 
of information (in the mathematical 
sense) which is far greater than that 
of which the human organism is capa- 
ble. Much of the research of the past 
decade in acoustic phonetics has em- 
phasized the basic significance of the 
formant frequencies in vowels, and this 
paper is primarily concerned with 
problems in formant frequency meas- 
urement. 

The nature of the parameters of 
vowel production which are of pri- 
mary significance may be anticipated 
from a consideration of the vocal 
mechanism. The vocal tract is essen- 
tially an acoustical tube. In vowel for- 
mations the laryngeal generator is lo- 
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Figure 1. Narrow-band sound spectrograms and amplitude sections showing the fundamental 
voice frequency and the vowel formants. The upper picture is for a sustained vowel with 
rising fundamental frequency. The lower picture is for a vowel sequence with constant 


fundamental voice frequency. 


cated at one end and sound waves are 
radiated from the other. The generator 
in turn is driven from the trachea, 
which serves as a supply tube. Such 
an acoustical system may be considered 
as a sequence of coupled resonance 
elements, with resonance peaks in the 
frequency-response characteristic. The 
parameters which may have basic im- 
portance in such a system are the fun- 
damental driving frequency (and the 
input spectrum) of the generator and 
the frequencies, damping constants, 


and relative amplitudes of the reson- 
ances. Other parameters such as anti- 
resonance frequencies may become 
important when the system is made 
more complicated, as by the addition 
of the nasal passageways in vowel for- 
mation. 


Instrumentation 


With the current apparatus of speech 
analysis, the various parameters indi- 
cated above may be isolated without 
difficulty in marty vowel signals. How- 
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ever, there are certain circumstances 
under which the measurement of these 
parameters becomes difficult, and it is in 
these cases that particular attention to 
instrumentation is required. 

The manner in which the fundamen- 
tal frequency of the voice may be de- 
rived by means of the sound spectro- 
graph is shown in Figure 1. The dis- 
tinction between the voice frequency 
and the resonance or formant frequen- 
cies is indicated in this figure. These 
analyses were made with the narrow 
filter condition of the instrument. In 
the upper picture is shown a narrow 
filter spectrogram for a vowel with 
a continuously rising fundamental voice 
frequency. At the left of the picture, 
both the continuous narrow-band anal- 
ysis and the amplitude sections show 
that there were two major regions of 
resonance in the general area of 1000 
cps for this vowel. As voice frequency 








rises it becomes increasingly difficult in 
the figure to discriminate the two res- 
onances; and for the higher voice fre- 
quencies there is the suggestion of just 
one broad resonance for the vowel. 
For the lower picture of Figure | 
the fundamental frequency of the voice 
was held as constant as possible and the 
vocal cavities were altered to produce 
a sequence of two successive vowel 
positions. It will be noted that the har- 
monics remain essentially horizontal 
and that the resonance frequencies or 
formants vary according to the vowel. 
When a relatively wide filter is em- 
ployed in the analysis, the picture 
which results is illustrated in the upper 
spectrogram of Figure 2. The harmon- 
ics have now been combined to form 
broad bar patterns. This spectrogram 
is for the digits “one, two, three.’ Hug- 
gins (5) has modified a sound spectro- 
graph to include an analyzing filter 
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Ficure 2. Broad-band sound spectrogram and resonogram of the same utterance. 
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Ficure 3. Sound spectrogram of an utterance anaylzed with the conventional wide filter and 
with two other tuned-circuit characteristics as shown in Figure 4. 


which operates on a phase principle. 
Under certain circumstances such a 
system will display a formant with a 
relatively narrow line. Essentially, the 
lines correspond to the peaks of the 
envelopes of the amplitude sections 
shown in Figure 1. The instrument em- 
ployed in marking the upper patterns 
of Figure 2 was modified to include 
such a phase filter, and an analysis of 
the same utterance shown in the upper 
spectrogram is shown with a peak or 
phase filter analysis in the lower picture 
of Figure 2. 

Another, probably equally effective 


method of obtaining increased defini- 
tion of the central formant frequency 
is illustrated in Figure 3. In the upper 
pattern of this figure is shown the con- 
ventional spectrogram of the digits 
‘one, two, three,’ as made on the type 
of device employed in marking the pic- 
tures of the book Visible Speech (10). 
The lower two patterns show spectro- 
grams which were obtained with simple 
tuned-circuit filters whose sides were 
sharply attenuated by the steeper slopes 
of the regular broad-band filter. In the 
instrument with which the spectro- 
grams of Figure 3 were obtained, there 
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Ficure 4. Analyzing filters employed in constructing the spectrograms of Figure 3. The 
two narrower characteristics are for tuned circuits combined with the wide filter. Due to 
the speed-up factor in the spectrograph, the effective analyzing width is approximately one- 


third that indicated. 


is a speed-up in the analysis by a factor 
of approximately three. The actual 
filter is thus three times the width of 
its effective value. Frequency charac- 
teristics for the three filters employed 
in the analyses of Figure 3 are shown in 
Figure 4. 

The patterns shown in Figures 2 and 
3 are for a male voice with a relatively 
low fundamental frequency. Consid- 
erably more difficulty arises in formant 
display for voices with a higher funda- 
mental voice frequency, as illustrated in 
the upper right-hand portion of Figure 
1. In order to illustrate the effect of 
fundamental frequency when various 
analyzers are employed, simple signals 
were generated with the system dia- 
grammed in Figure 5. The frequency 


‘of the oscillator controlled the fre- 


quency of the pulse generator, and 
the pulse generator in turn excited the 
tuned circuit whose frequency charac- 
teristic is shown in the graph in the 
upper part of the figure. 

The degree of success with which 
the resonant peak of the tuned circuit 
was detected by various types of ana- 
lyzers is indicated in Figure 6. The 
fundamental pulse frequency was 
varied throughout a considerable range 
for each of the spectrograms. In A, a 
resonogram was constructed with the 
same analyzing circuit as that employed 
in making Figure 2. The fundamental 
frequency driving the tuned circuit 
was varied from a low frequency, 
which permitted easy detection of the 
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Figure 5. The tuned circuit whose frequency characteristic is shown in the upper figure 
was excited by the pulse circuit indicated in the lower figure. The repetition rate was 
controlled by the oscillator. 








oe 


Ficure 6. Spectrographic analyses of the output of the tuned circuit of Figure 5, when ex- 
cited by a pulse of increasing repetition rate. Frequency extends vertically and increase in 
pulse repetition rate increases with time from left to right. A is resonogram; B, analysis with 
narrow tuned circujt shown in Figure 4; C, analysis with intermediate tuned circuit shown 
in Figure 4, . 
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peak of the resonance, through to a 
point where the second harmonic of 
the excitation frequency was marked 
by the analyzer. At the right in the 
upper pattern in Figure 6, che correct 
frequency of the resonance is shown by 
the hand-drawn line. In B and C the 
analyzing tuned circuits whose re- 
sponses are shown in Figure 4 were 
employed. In B the frequency of the 
input signal terminated in a value such 
that the second and third harmonics 
are displayed. In C the fundamental 
frequency of the signal terminated in a 
value within the range of the analyzer. 

In such signal analyses, three pri- 
mary factors are involved: the genera- 
or, the signal system which is excited 
by the generator, and the analyzer. Fig- 
ure 6 emphasizes the fact that with 
current filtering techniques the success 
with which resonance peaks in the 
signal system may be detected depends 
very much upon the response charac- 
teristic (or transient time) of the 
analyzing filter relative to the recurrent 
rate at which the system is excited. 

The following parameters are of pri- 
mary interest: F,, the fundamental fre- 
quency of excitation by the generator, 
F,, the nth response (resonance) fre- 
quency of the signal system; AF,, the 
band width in cycles per second of F,, 
at the 3 db points; and AW, the band 
width in cycles per second of the ana- 
lyzing filter at the 3 db points. 

The relationship between AW and 
F, is probably the most important prac- 
tical consideration in the spectro- 
graphic analysis of speech. When AW 
is appreciably less than F,, the output 
of the analyzing filter corresponds to 
the amplitudes ‘of the individual har- 
monics within the signal. In this case 
little energy will appear at the output 


of the analyzer at the frequency of 
F,, unless F, approximates some inte- 
gral multiple of Fo. 

If AW is appreciably greater than 
F,, then the individual harmonics are 
normally no longer indicated, and an 
energy peak approximating F, appears 
at the output of the analyzing filter. 

When F,, is located within a com- 
plicated spectrum, the envelope curve 
on the two sides of the resonance may 
be markedly asymmetrical. Thus when 
AW >> AF, > F,, the frequency at 
which the maximum output of the ana- 
lyzing filter occurs may not correspond 
closely to F,. Thus (when AF, > F.) 
best resonance definition will generally 
be obtained if AW = AF,. 

When AF, >> F,, then the band- 
width or damping constant of the 
resonance is rather clearly defined in 
the amplitude section. An example of 
this is shown at the left in the upper 
pattern of Figure 1, where F, is rela- 
tively low. In this case AF, is well 
depicted. 

In voices with a high fundamental 
frequency, however, it often occurs 
that AF, >> F,. In such cases it is 
obviously impossible to derive accu- 
rately the resonator band widths from 
measurements of the present type of 
narrow-band amplitude sections. This 
is illustrated by the right-hand portion 
of the upper picture of Figure 1 where 
AF, is largely indeterminate. Since this 
portion of the pattern represents essen- 
tially the same settings of the vocal 
cavities, as at the left of the spectro- 
gram, there would appear to be no 
justification for assuming that AF, has 
changed markedly. 

In the analysis of vowels, a special 
difficulty occurs when two resonances 
approach in frequency. As indicated 
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Figure 7. A continuous vowel wave and 
individual fundamental voice frequency pe- 
riods derived from it for separate spectral 
analyses. 


above, if the frequencies of the indi- 
vidual resonances are to be derived 
it is necessary to have AW > F,; and, 
for accurate specification of F), it is 
desirable that AW = AF, >> F,. An 
extreme case occurs when (F,,41 — Fn) 
= F,. If AW < F,, then the individual 
harmonics appear in the analysis; and 
if AW > F,, only one broad region of 
energy concentration may be observ- 
able. Thus, the difficulty is substantial 
whether AW is greater than or less 
than AF,. The problem is particularly 
evident in vowels where Fs approaches 
the value for F;. 

The above described inherent limita- 
tions in present techniques of spectro- 
graphic analyses can be considerably 
reduced by analyzing each fundamental 
voice period separately. With this 
method each period is separated out 
in the time domain and zero input am- 
plitude is maintained beyond its bound- 
aries, as illustrated in Figure 7. A dis- 
cussion of the mathematical basis and 
possible instrumental procedures for 


individual pitch period analysis, how- 
ever, are beyond the scope of the 
present paper. 

The difficulties of formant detection 
are somewhat less severe for actual 
speech than for the analysis of sus- 
tained vowels. In normal speech the 
voice fundamental and the resonances 
are in almost continuous and relatively 
independent movement within the fre- 
quency domain. As resonances and har- 
monics cross, both amplitude and phase 
effects may be of value in observing 
the resonance peaks within the dynamic 
patterns. In general, however, the ac- 
curacy with which shifting resonance 
frequencies can be determined depends 
considerably upon the transient times 
involved. Normally, F, cannot be ac- 
curately defined in appreciably less than 
the build-up time of the resonance; 
this time is approximately 1/AF, (3). 
When both F,, and F, vary, there is a 
physical indeterminancy which depends 
upon their joint rates of movement in 
the frequency domain. The auditory 
perception of such dynamic acoustical 
patterns is very much a matter for 
future investigation. 


Formant Measurements 


Various methods of formant fre- 
quency evaluation have been suggested 
previously. Most of the procedures are 
based upon the measurement of har- 
monic amplitudes obtained with rela- 
tively narrow analyzing filters. 


Amplitude Weighting. One method 
involves a weighting of the individual 
harmonic components according to 
their amplitudes within the formant 
(11). In such a weighting procedure 
the formant value is normally shifted 
toward the low frequency side of the 
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resonance peak. This results from the 
fact that the vocal cavities are of a low- 
pass nature in their frequency char- 
acteristic, so that the lower frequencies 
within the vowel spectrum generally 
have somewhat higher amplitudes. The 
magnitude of the shift generally in- 
creases as the range of frequencies 
considered in the formant evaluation 
is increased. 


Spectrum Transformation. lf formant 
measurement is to be based upon the 
harmonic structure of the formant, it 
might seem desirable to take into ac- 
count the non-linear frequency and 
amplitude characteristics of auditory 
sensation. If psychophysical functions 
for the response of the human observer 
to pure tones are applied to vowel 
spectra, harmonic frequencies and am- 
plitudes are converted to mels and 
sones. Masking effects also may be 
taken into account in such spectrum 
transformation. Certainly at this stage 
of psychophysical knowledge, how- 
ever, caution must be exercised in the 
application of scales based upon ele- 
mentary stimuli, such as pure tones, 
to the evaluation of complex signals. 

When formant peaks are derived 
from such psychophysical vowel plots, 
the formant values tend to be shifted 
above the peak frequencies observed in 
the undistorted spectra. It is primarily 
the lower formants which are shifted, 
since the ear decreases rapidly in sensi- 
tivity in the low frequency region. The 
shifts are thus in the direction opposite 
from that indicated above for direct 
weighting of harmonic amplitudes. 


Strongest Harmonic. A somewhat 
different and relatively simple proce- 
dure is to specify the formant fre- 
quency as the frequency of the har- 


monic of highest amplitude within the 
formant. 

It is obvious that the above methods 
of formant frequency evaluation are 
essentially arbitrary in nature. None 
of them is adequate to deal with the 
highly variable conditions of funda- 
mental frequency and spacing between 
formants which are found in actual 
speech. At one extreme only one har- 
monic may occur in the general region 
of two adjacent formant peaks. 


Response Frequencies. In formant 
frequency evaluation, it is important 
to avoid a limited concept of vowel 
frequency structure in which only the 
narrow-band harmonic components are 
considered. If the vocal mechanism is 
considered to be the fundamental infor- 
mation source in speech, then meas- 
urements of the acoustical signal which 
most directly reflect its properties are 
of primary significance. The most fun- 
damental functional data about such a 
resonant system are the response fre- 
quencies. The frequencies of the peaks 
of the resonances approximate the nat- 
ural response frequencies of the system. 
Scripture (/3) and more recently 
others (2, 6) have emphasized that it 
is these peaks which are of primary 
interest in formant frequency evalua- 
tion. The present author can see no 
justification for weighting calculations 
based on the asymmetry of formants, 
low-pass frequency characteristics of 
the vocal cavities, or mel and loudness 
functions. 

Since the resonance peaks of the 
cavities are basic, it might be thought 
that some type of correction should 
be made for the spectrum of the input 
Jaryngeal pulse and the radiating char- 
acteristics of the lips. There is some 
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question, however, as to whether either 
of these change sufficiently rapidly as 
a function of frequency to have a sig- 
nificant influence upon the judgment of 
formant frequency peaks in sound 
spectrograms. In addition, the input 
laryngeal pulse has a spectrum which 
falls as a function of frequency, and the 
radiating characteristic of the vocal 
cavities is essentially rising in fre- 
quency. Thus these two effects which 
are superimposed upon the natural fre- 
quency response characteristic of the 
vocal cavities tend to cancel. 

With present spectrographic tech- 
niques, then, it appears that the most 
meaningful procedure for formant 
measurement is the direct evaluation 
of the frequency positions of the peaks 
of the vowel envelope curve. Until 
more sophisticated techniques of in- 
strumental analysis are available, there 
does not appear to be any simple pro- 
cedure for the accurate derivation of 
formant frequencies when the formants 
are relatively close together in fre- 
quency and when the fundamental 
voice frequency is relatively high. 


Summary 


The measurement of formant fre- 
quency has been discussed according to 
the concept of the vocal mechanism as 
the basic information source in speech 
communication. The formant frequen- 
cies represent essential acoustical prop- 
erties of the human vocal mechanism. 
Problems in the detection of these fre- 
quencies are reviewed, and procedures 
in the measurement of vowel spectra 
derived by currently available instru- 
mental techniques are discussed. It is 
concluded that measurements of the 
peaks of the vowel envelope curves 


provide the most meaningful data re- 
garding vowel formant frequencies. 
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Unconditioned Stimulus Strength 
and the Galvanic Skin Response 


LEO G. DOERFLER 


JOAN C. KRAMER 


In the type of conditioning which is 
employed currently in galvanic skin 
response audiometry, it is important to 
use an unconditioned stimulus which 
uniformly produces a galvanic skin re- 
sponse. Most of the galvanic skin re- 
sponse testing of auditory acuity at the 
present time employs electric shock as 
the unconditioned stimulus. Studies by 
Doerfler and McClure (/), Goldstein 
and others (4), Grant and others (5), 
Humphreys (6), Meritser and Doerfler 
(7), and Stewart (12), which were con- 
cerned with such variables as reinforce- 
ment of the unconditioned stimulus, 
difficulty of conditioning subjects, and 
validity of galvanic skin response, all 
utilized electric shock for the uncondi- 
tioned stimulus. The extensive use of 
electric shock beth clinically and in 
research would seem to indicate that it 
has been an effective procedure. How- 
ever, in order to effect further refine- 
ment of clinical procedures involving 
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galvanic skin response and electric 
shock, it would seem that research con- 
sideration should be given to those 
factors which might influence ease of 
conditioning and resistance to extinc- 
tion. 

The intensity of the unconditioned 
stimulus as a factor in conditioning and 
resistance to extinction is an area that 
has been subjected to very little experi- 
mental work. Working with air puff as 
the unconditioned stimulus, Passey (8), 
Spence and Taylor (9), Spence (J0), 
and Taylor (13), have all suggested that 
the intensity of the unconditioned stim- 
ulus is related to ease of conditioning 
and resistance to extinction, with the 
stronger unconditioned stimulus giving 
faster conditioning and stronger resist- 
ance to extinction. 

In the study of testing of auditory 
acuity with galvanic skin response, 
using electric shock as the uncondi- 
tioned stimulus, there has been no sys- 
tematic attempt to indicate whether 
there is a difference in the conditioning 
and the resistance to extinction that is 
achieved with various levels of shock. 
Because clinicians in the field of audi- 


. ology are constantly searching for bet- 
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Ficure 1. Circuit diagram for galvanic skin response amplifier. 


ter defined and more systematic clinical 
procedures in galvanic skin response 
audiometry, it would appear to be of 
interest to investigate different levels 
of the unconditioned stimulus with 
pure tone as the conditioned stimulus 
and electric shock as the unconditioned 
stimulus. 

The problem in this study was to 
determine the relationship between the 
strength of the unconditioned stimulus 
(shock) and acquisition of the condi- 
tioned galvanic skin response. A second 
problem was to determine the relation- 
ship between resistance to extinction 
and strength of the unconditioned stim- 
ulus. 


The specific questions asked were: 
(a) Will acquisition of conditioning of 
the galvanic skin response to a pure 
tone differ if the strength of the uncon- 
ditioned stimulus (shock) is varied in 
three ways: minimal, clinical, maximal? 
(b) Will resistance to extinction of the 
conditioned galvanic skin response 
differ if the strength of the uncondi- 
tioned stimulus is varied in three ways: 
minimal, clinical, maximal? 


Procedure 


The subjects were 72 volunteers, 42 
females and 30 males. These subjects 
ranged in age from 16 to 57 years. Re- 
strictions imposed for selection of sub- 
jects were these two requirements: (a) 
response in the ear to be tested to a 
pure tone of 1000 cps at 15 db re 
normal threshold; (b) unfamiliarity 
with the principles of conditioning or 
galvanic skin response techniques. 

The galvanic skin response apparatus 
used in this experiment has been de- 
scribed in previous studies (/, 7, 1/). 
Figure 1 shows a circuit diagram of the 
amplifier. 

The conditioned stimulus was a pure 
tone produced by a commercial audi- 
ometer. The conformance of this audi- 
ometer to Bureau of Standards calibra- 
tion was checked in a Type 9A coupler 
at the Department of Audiology, Uni- 
versity of Pittsburgh. The 1000-cycle 
tone was presented at an intensity level 
of 30 db above normal threshold, and 
was always given at this level when 
called for by the randomized schedule. 
Time intervals between presentations 
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Figure 2. Circuit diagram for shock-producing 
apparatus. 


(in seconds) and shock (S) or no shock 
(NS), in the order given, were as fol- 
lows: 35, S; 40, NS; 30, NS; 40, NS; 
20, S; 25, NS; 20, S; 20, NS; 35, NS; 
20, S. The tone was audible for approx- 
imately 2.5 sec. A 40% randomized 
reinforcement schedule was employed. 

The unconditioned stimulus was a 
pulsatile electric shock. Figure 2 shows 
a circuit diagram of the shock appara- 
tus. The shock was presented for ap- 
proximately .2 sec upon completion of 
the conditioned stimulus, when called 
for by the randomized schedule. Ap- 
proximately .5 sec elapsed after cessa- 
tion of the conditioned stimulus and 
the onset of the unconditioned stimulus. 

The subjects were randomly assigned 
to one of three shock-level groups, and 
testing was carried on until each group 
consisted of 24 subjects. A total of 92 


subjects was originally tested to obtain 
the experimental sample of 72. 

Group A, minimal shock, received 
the least amount of shock necessary to 
produce a detectable skin response. 
This was done by setting the shock 
potentiometer at its lowest level and 
increasing it in dial units of five until 
two acceptable galvanic skin responses 
occurred at the same dial reading. Once 
the minimal shock level was established, 
the dial setting was not altered through- 
out the conditioning schedule. 

Group B, clinical shock, was in- 
structed to indicate to the experimenter 
when the shock was felt. At this point, 
the shock was increased five dial 
units, and the conditioning schedule 
begun. If the subject did not meet the 
conditioning criteria at the end of the 
first conditioning schedule, the shock 
level was increased 10 units and the sec- 
ond schedule begun.* This was the only 
group in which shock intensity could 
be altered during testing. The proce- 
dure was modeled after the standard 
clinical procedure during conventional 
galvanic skin response testing of adults 
at the Department of Audiology. 

Group C, maximal shock, was in- 
structed that the shock would be 
turned on and gradually increased. The 
subjects were requested and urged after 
each 10-unit increase in shock level to 
take more shock, until each subject’s 
maximal level was reached (judged by 


"The criteria employed were identical with 
those described by Stewart (12), Doerfler and 
McClure (1), and Meritser and Doerfler (7). 
They were (a) minimum GSR amplitude of 
5 mm; (b) minimum GSR slope of 45°; (c) 
latency distance of not less than three or more 
than 13 mm as measured between tone onset 
‘pip’ and leading slope of the response. Ester- 
line-Angus chart speed was .5 in. per § sec, 
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TaBLeE 1. Distribution of subjects with respect 
to acquisition of conditioning by one, two, three, 
and four or more schedules. Each of three groups 
(N=24 for each group) received a different level 
of the unconditioned stimulus (shock). 








Number of Schedules 





Group Four or 
One Two Three More 
Minimal Shock 17 2 3 2 
Clinival Shock 17 4 2 1 
Maximal Shock 21 i 0 2 








his request to receive no more shock). 
Once this level was set, it was not 
changed during the test. 

The gain control on the galvanic 
skin response amplifier was set at a lev- 
el judged by the experimenter to be 
adequate to produce readable responses. 
This was not altered during the re- 
mainder of the conditioning and extinc- 
tion procedures. 

A subject was considered condi- 
tioned when he gave three consecutive 
responses to tones without shock, all 
three of which met the criteria of am- 
plitude, latency, and slope. The pro- 
cedure was set up so that at the end 
of the minimum conditioning schedule 
of 10 trials, three tones without shock 
were given. If the subject failed one of 
these responses, the schedule was im- 
mediately begun again. Once the con- 
ditioning criteria of three acceptable 
responses were met, he was considered 
conditioned and the extinction schedule 
started. Time intervals (in seconds) be- 
tween presentations were 35, 20. 20, 
25, 20, 40, 30, 40, 40, 25, 35, 25, 25, 20, 
29-3955) 59, 40; LIC 295 206 9D, 254 205 20s 
The three no-shock trials to determine 
conditioning were incorporated into 
the extinction schedule and considered 
to be the first three extinction trials. 
The extinction schedule was continued 
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until the subject failed to give two con- 
secutive responses meeting the criteria. 
Once this occurred, he was considered 
extinguished and the test discontinued. 

The galvanic skin responses were 
measured with the help of a transpar- 
ency, which enabled the experimenter 
to judge quickly and easily whether a 
response was acceptable. 


Results and Discussion 


In order to determine whether there 
were significant differences among the 
three methods of conditioning in terms 
of varying intensities of the uncondi- 
tioned stimulus, that is, minimal, clin- 
ical, maximal, it was hypothesized that 
there would be no difference between 
cach of the groups in terms of the num- 
ber of subjects conditioned by one 
schedule or more than one schedule. 
Fisher’s (3) direct method of comput- 
ing probability was employed to com- 
pare the minimal with the clinical 
group, the minimal with the maximal 
group, and the clinical with the maximal 
group. There were no statistically sig- 
nificant differences between any of the 
groups in acquisition of conditioning. 
It can be seen from Table 1 that the 
number of subjects conditioned by one 
schedule in each of the groups was 
quite similar and that approximately 
the same number of subjects required 
more than one schedule for condition- 
ing in each of the shock level groups. 

These results would appear to sug- 
gest no relationship between intensity 
of the unconditioned stimulus and ac- 
quisition of conditioning as far as this 
study is concerned. These results dif- 
fer from those reported by Passey (8), 
who indicated a relationship between 
strength of the unconditioned stimulus 
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TasiLe 2. Mean, standard deviation, and range 
of number of trials to extinction for each of three 
groups, each with a different level of the uncon- 
ditioned stimulus (shock). 








Group Mean SD: Range 





Minimal Shock 15.29 12.61 3-40 
Clinical Shock 21.25 16.47 3-60 
Maximal Shock 37.83 32.23 6-115 








and acquisition of conditioning. How- 
ever, if it were possible to have esti- 
mated conditioning in terms of number 
of trials rather than number of sched- 
ules, a similar relationship may have 
_ been revealed. The use of number of 
schedules as the measuring instrument 
was necessary in order to maintain 
approximately a 40% reinforcement 
schedule. It seems possible that differ- 
ences might still exist in terms of acqui- 
sition of conditioning and intensity of 
the unconditioned stimulus but were 
not revealed because of the grossness 
of the measure. 

In order to answer the second ques- 
tion raised in this study concerning the 
effect of the unconditioned stimulus 
on extinction, the mean number of 
trials to extinction and the variability 
of each of the three groups conditioned 


TaBLe 3. Results of the ¢ test (N=24; df=23) for 
evaluation of differences between mean numbers 
of extinction trials. Comparisons are among three 
groups, each with a different level of the uncon- 
ditioned stimulus (shock). 








Groups Difference t P 





Minimal Shock 5.96 1.38 N. 8.* 
Clinical Shock 


Minimal Shock 
Maximal Shock 


Clinical Shock 16.58 2.20 .05 
Maximal Shock 


22.54 3.13 01 








*Not significant. 


under three different strengths of 
shock were determined. An examina- 
tion of these data, as shown in Table 2, 
would appear to show that the maximal 
group on the average resisted extinc- 
tion the greatest, and that the minimal 
group extinguished the earliest. 


It seemed appropriate, after these 
differences in extinction were observed, 
to determine whether significant dif- 
ferences existed among the three 
groups in terms of the mean number of 
trials necessary for extinction. Table 3 
summarizes the results of the analyses 
of the significance of differences, uti- 
lizing the ¢ test. Because there were 
large differences among the standard 
deviations of the three groups, the 
usual ¢ test was not employed. Coch- 
ran’s (2, pp. 168, 169) formula for es- 
timating the value of t was used. Clin- 
ically, the most important result to be 
considered was the comparison be- 
tween clinical and maximal groups. 
The difference of 16.58 between these 
groups was significant at the .05 level. 
It would seem then, that on the average 
the subject remains conditioned the 
longest under conditions of maximal 
shock. However, the great variability 
observed might temper the clinical ap- 
plication of these results. There is every 
indication that some individuals will 
extinguish quite early regardless of 
strength of the unconditioned stimulus. 


The finding that maximal shock 
brings about greater resistance to ex- 
tinction than minimal or clinical shock 
is of extreme importance clinically, 
since resistance to extinction plays a 
vital role in the clinical measurement 
of hearing by galvanic skin response. 
In the course of searching for thresh- 
old, if the subject does not respond to 
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TaBLe 4, Chi-square results for number of 
schedules to condition and number of trials to 
extinction. 











Group x df yh 
Minimal Shock 5.04 1 .05 
Clinical Shock 1.68 1 .20 
Maximal Shock 1.20 : .30 








*Because the theoretical frequencies were small, 
Fisher’s (3) direct method of computing proba- 
bility was also employed: P=.03, .18, and .31 
for Minimal, Clinical, and Maximal Groups, re- 
spectively. 


a presentation of the tone, the clinician 
cannot be certain whether the subject 
has extinguished or whether he is not 
hearing the tone. Chances are increased 
greatly that he has not extinguished 
when maximal shock is used in pref- 
erence to minimal or clinical levels of 
shock. 

In addition to the statistical treatment 
necessary to answer the two major 
questions posed by this study, certain 
comparisons were made to investigate 
any other relationships that might have 
existed. One such comparison involved 
determining whether those subjects 
who conditioned early also extinguished 
more rapidly within each of the three 
groups. It was hypothesized that within 
each group of those subjects extinguish- 
ing in 10 or fewer trials, there would 
be as many requiring one schedule to 
condition as there would be requiring 


TaBLE 5. Mean, standard deviation, and range 
of number of trials from first failure to extinction. 
Comparisons are among three groups, each with 
a different level of the unconditioned stimulus 
(shock). 











Group Mean S.D. Range 
Minimal Shock 4.71 6.35 0-20 
Clinical Shock 6.79 9.15 0-29 
Maximal Shock 14.54 17.80 0-59 
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more than one scheduie to condition. 
The same hypothesis was made con- 
cerning those subjects extinguishing in 
11 or more trials within each group. 
The statistic of chi square was applied. 
The results of this analysis are shown 
in Table 4. Only for the minimal group 
was there any evidence of a relation- 
ship between number of schedules to 
condition and number of trials to ex- 
tinction. 


TaBLE 6. Results of the ¢ test (N=24; df=23) for 
evaluation of differences between mean numbers of 
trials between first failure and extinction. Com 
parisons are among three groups, each with a dif- 
ferent level of the unconditioned stimulus (shock). 











Groups Mean l P 
Difference 
Minimal Shock 2.08 .90 N. S.* 
Clinical Shock 
Minimal Shock 9.83 2.49 05 
Maximal Shock 
Clinical Shock 7.75 1.86 -08 


Maximal Shock 








* Not significant. 


It will be recalled that the criterion 
for extinction was two successive fail- 
ures. It was noted that single failures 
often occurred that were not followed 
by another failure. In an effort to de- 
termine the relationship of first failure 
to extinction as defined in this study, 
an analysis of the three groups was 
made in terms of the number of trials 
between first failure and extinction. 
Table 5 summarizes these results. An 
examination of these data shows that 
the minimal group had the least num- 
ber of acceptable responses between 
first failure and extinction, while the 
maximal group had the largest number. 
Also, the minimal group had the least 
amount of variability around the mean, 
indicating that it was the most reliable 
indication of what to expect in terms of 
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extinction following first failure. The 
general results here indicate again that 
extreme caution is necessary in the 
clinical application of these data as 
far as expectancy of extinction follow- 
ing first failure is concerned. 

In Table 6 is shown the significance 
of differences among the three groups 
in terms of number of trials to extinc- 
tion following first failure. Cochran’s 
(2) formula for estimating the value of 
t was used. The difference between the 
minimal and clinical groups was not 
significant. The difference of 9.83 be- 
tween the minimal group and maximal 
group was significant at the .05 level, 
indicating that the maximal group need- 
ed a significantly greater number of 
trials for extinction following first 
failure than the minimal group. The 
difference between the clinical and 
maximal groups was not significant at 
the .05 level. However, the ¢ had a 
probability value of .08, indicating that 
only eight times in 100 would differ- 
ences of this magnitude or greater arise 
by chance. 

It became evident during the testing 
of the subjects receiving minimal shock 
that many did not acquire the condi- 
tioned response after several condition- 
ing schedules. There were 20 of the 
original 92 subjects who at no time 
gave three consecutive acceptable re- 
sponses to tone alone. Each of these 
subjects was taken through a minimum 
of six schedules (the average was more 
than 70 conditioning trials) before be- 
ing considered unconditionable. Of 
these 20 subjects, 18 were in the min- 
imal shock level group, one was in the 
clinical group, and one was in the max- 
imal group. In order to ascertain wheth- 
er the level of shock was the determin- 
ing factor in the failure of the minimal 


shock group to condition, as many of 
this group as possible, 10 subjects, were 
brought back and retested using max- 
imal shock. The 10 retested subjects 
were conditioned with maximal shock, 
nine by the end of the first schedule, 
that is, 90%, which is comparable to 
the 87.5% of the original maximal 
group conditioned by the first sched- 
ule. (The range of the number of trials 
for subjects of the retested group con- 
ditioned by the end of the first schedule 
was from five to 32; the number for 
the subject who had two schedules 
was 111.) 

It will be recalled that one subject 
in the clinical and one subject in the 
maximal group were considered uncon- 
ditionable, in terms of the criteria of 
this study. Both of these individuals 
revealed a galvanic skin response when 
shock was presented. However, they 
did not meet the criteria when the 
tone was given alone. It has been felt 
by some experimenters in the field of 
galvanic skin response testing that there 
are certain individuals who could be 
termed ‘difficult to condition’ or ‘un- 
conditionable.’ Goldstein and others 
(4) described this situation in testing 
32 normal hearing children with gal- 
vanic skin response as follows: 


Eighteen of the children were easy to 
condition and 14 difficult to condition ac- 
cording to the subjective standards used 
in a previous study. .. . Ease of condition- 
ing seems to be unrelated to the intelli- 
gence quotient as measured by a perform- 
ance test. Ease of conditioning also seems 
unrelated to sex; to age within a range of 
seven years and six months to twelve 
years and eight months; to handedness; to 
initial, terminal, or maximum resistance; 
to random fluctuations of resistance; to 
latency of response; and to overt responses 
and behavior, except for sleepiness. 


It appears that regardless of the level 
of shock employed, there will be some 
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individuals who cannot be satisfactorily 
conditioned for purposes of galvanic 
skin response audiometry. However, in 
this sample the number of such indi- 
viduals who did not condition satisfac- 
torily appeared to be directly related 
to the level of the unconditioned stim- 
ulus. Those subjects who did not con- 
dition were overwhelmingly in the 
minimal shock group. This would tend 
to agree with Taylors (/3) finding 
that adaptation seems to take place 
more rapidly when the unconditioned 
stimulus is mild. 

It is important to remember that 
even though maximal shock appears 
to be the best setting for maintaining 
conditioning, there are certain factors 
which could hamper its use clinically. 
In spite of the fact that the adults used 
in this study were volunteers, many of 
them had strong emotional reactions to 
the shock when it was presented at the 
maximal level. The decision as to 
whether maximal or clinical levels of 
shock should be used on patients in an 
actual clinical situation should take 
many things into account. If the patient 
is undergoing a diagnostic audiological 
evaluation for purposes of rehabilita- 
tion, it is possible that maximal shock 
could impede the establishment of rap- 
port and retard the rehabilitative proc- 
ess. However, in other instances such 
as medico-legal cases, it may be neces- 
sary to employ maximal shock in order 
to insure the most accurate assessment 
of hearing. Still another factor that 
should be considered is the genezal 
health of the patient. Although there 
is no objective evidence either for ac- 
cepting or rejecting this idea, it is pos- 
sible that certain physical or mental 
diseases operant in a patient would con- 


traindicate the use of maximal shock. 
Still another factor might be the gen- 
eral psychological state of the patient 
when he is entering the testing situa- 
tion. With children, it is probably not 
possible to use anything above the clin- 
ical level of shock. The net result of 
this study would seem to indicate that 
there are certain advantages in the use 
of maximal shock, provided it is feas- 
ible to use it. 


Summary 


The galvanic skin response in 72 sub- 
jects with normal hearing was condi- 
tioned to a pure-tone auditory stimulus 
of 1000 cps using electric shock as the 
unconditioned stimulus. Objective cri- 
teria for designation of responses in- 
cluded determination of amplitude, 
latency, and slope. One-third (24 sub- 
jects) of the group received shock at 
a minimum level of intensity; one-third 
received shock at a clinical level of in- 
tensity; one-third received shock at a 
maximal level of intensity. Once con- 
ditioning was established, extinction 
trials were begun. All groups were ex- 
tinguished in the same manner. 

Results indicated that a statistically 
significant difference existed in number 
of responses to extinction among these 
three groups, in the direction of great- 
er resistance to extinction by the max- 
imal group. No statistically significant 
difference was found on acquisition of 
conditioning among the three groups. 
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This bibliography represents a system- 
atic review of published literature in 
the area of delayed auditory feedback. 
Current titles are included through cor- 
respondence of the authors with inves- 
tigators working in the field. Certain 
articles on related subjects, which were 
brought to the attention of the authors 
as a product of this correspondence, 
have been included in a section called 
‘Related Papers’ presented at the end 
of the main bibliography. 

The literature on sensory feedback 
may be said to go back to the nine- 
teenth century when ideas about rela- 
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tionships between sensory and motor 
events achieved a new clarity. Early 
in the century Sir Charles Bell and 
Francois Magendie discovered the sen- 
sory and motor functions of the spinal 
roots; in 1826, Bell wrote about the 
‘circle of nerves’ between the brain and 
muscles. In his Principles of Psychol- 
ogy (1890), William James speculated 
about the importance of sensory infor- 
mation to the initiation of voluntary 
movement and argued that the sensa- 
tions subserve a ‘guiding’ function in 
the control of voluntary movement. 
John Dewey in 1896 argued against the 
unreal dualism of ‘the reflex arc con- 
cept in psychology’ and supported the 
position that stimulus and response are 
the functionally indistinguishable parts 
of a coordinated ‘sensori-motor’ event. 

The idea that coordinated voluntary 
movement depends upon the utilization 
of sensory information by the nervous 
system is now generally accepted. The 
body of knowledge supporting this po- 
sition has developed through research 
on different motor functions among 
which has been that of speech. In 1950 
and 1951, Lee and Black observed a 
remarkable amount of disturbance in 
speech under conditions of delayed 
auditory feedback. This observation 
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served to focus attention on the impor- 
tance of sensory information to the 
normal control of speech. Since that 
time much attention has been given to 
this phenomenon and its applications 
in the field of speech and hearing. As 
a result there has grown a considerable 
literature on which the following is 
based. 

In the compilation of the section on 
auditory feedback, these sources have 
been reviewed as indicated: 


1. Journal of Speech and Hearing Disorders, 
vol. 14 (1), 1949, through vol. 23 (5), 
1958. 

2. Journal of Speech and Hearing Research, 
vol. 1 (1), 1958, through vol. 1 (3), 1958. 

3. Graduate theses in speech and hearing 
research, 1949 through 1956, as reviewed 
in 1 and 2 above. 

4. Journal of the Acoustical Society of 
America, vol. 21 (1), 1949, through vol. 
30 (10), 1958. 

5. AMA Archives of Otolaryngology, vol. 
51 (1), 1950, through vol. 68 (4), 1958. 

6. Speech: The Journal of the College of 
Speech Therapists (London), vol. 14 (1), 
1950, through vol. 21 (2), 1957. 

7. Quarterly cumulative Index Medicus 
(under: hearing, hearing disorders, hear- 
ing tests, speech, speech defects, and 
voice), vol. 51, 1952, through vol. 57, 
1955. 

8. Current List of Medical Literature, Na- 
tional Library of Medicine, U.S. Depart- 
ment of Health, Education, and Welfare 
(under: hearing, hearing disorders, hear- 
ing tests, phonetics, speech, speech dis- 
orders, and voice), vol. 20 (1), 1951, 
through vol. 34 (4), 1958. 

9. Psychological Abstracts (under: audition, 
response processes, language and com- 
munication, and speech disorders), vol. 
23 (1), 1949, through vol. 32 (2), 1958. 

10. Doctoral Dissertations, A. H. Trotier 
and M. Harman, Eds., compiled for As- 
sociation of Research Libraries (New 
York: H. W. Wilson), vol. 17, 1949-50, 
through vol. 22, 1954-55. 

11. Dissertation Abstracts (University Micro- 
films, Inc., Ann Arbor, Mich.) (under 
speech), vol. 16 (1), 1956, through vol. 
18 (6), 1958. 

(Note: The authors would appreciate receiv- 

ing information for future revisions of the 


bibliography. This would include new titles 
to be added as well as errors in the present 
listing.) 


Auditory Feedback 


1. Axprince, R., A comparative investi- 
gation of speech under conditions of 
amplification, noise, and delayed audi- 
tory feedback. Unpub. master’s thesis, 
Univ. Illinois, 1954. 

2. Arxtnson, C. J., Adaptation to delayed 
side-tone 1. U.S. Naval School Aviation 
Medicine, Naval Air Station, Pensacola; 
Ohio State Univ. Res. Found.; Cont. 
N6ONR 22525, Office Naval Res., Proj. 
NR 142-992, U.S. Naval School Aviation 
Medicine, Bur. Medicine Surgery Proj. 
NM 001 064.01.12, Joint Project Report 
No. 12, 1952. 

3. Atkinson, C. J., Adaptation to delayed 
side-tone. J. Speech Hearing Dis., 1953, 
18, 386-391. 

4. Arxrinson, C. J., Four studies on side- 
tone. Univ. Iowa; Signal Corps; Cont. 
DA 36-039 sc-42562, Supplementary re- 
port No. 5, 1954. (For other supple- 
mentary reports, same series, see refer- 
ences 14, 87, this list, and 22, Related 
Papers.) 

5. Arxtnson, C. J., Research study of the 
psycho-acoustic effects of human and 
artificial sidetone. Univ. Iowa; Signal 
Corps; Cont. DA 36-039 sc-42562, Army 
Proj. 3-99-12-022, First Quarterly. Re- 
port, 1952. 

6. Arkinson, C. J., ibid, Second Quarterly 
Report, 1953. 

7. Atkinson, C. J., ibid, Third Quarterly 
Report, 1953. 

8. Atkinson, C. J., ibid, Fourth Quarterly 
Report, 1953. 

9. Atkinson, C. J., ibid, Fifth Quarterly 
Report, 1953. 

10. Atkinson, C: J., ibid, Sixth Quarterly 
Report, 1954. 

11. Arxkinson, C. J., ibid, Seventh Quarterly 
Report, 1954. 

12. Atkinson, C. J., ibid, Final Report, 
1955. (See Scuupert, reference 87, this 
list.) 

13. Atkinson, C. J., Some effects on intel- 
ligibility as the sidetone level and the 
amount of sidetone delay are changed. 
Proc. lowa Acad. Sci., 1954, 61, 334-340. 

14. Atkinson, C. J., Some effects on rate 
and on relative sound pressure level of 
speech produced by altering sidetone 
delay and sidetone level. Univ. Iowa; 
Signal Corps; Cont. DA 36-039 sc-42562, 


oe Se ST oe oe 


Os 


al 


rt, 
his 


el- 
the 
ed. 
40. 
ate 


ne 
va; 
62, 


15. 


16. 


18. 


20. 


Chase, Sutton, 


Army Proj. 3-99-12-022. Supplementary 
Report No. 3, 1954. 

ATKINSON, C. J., Some influences of de- 
layed sidetone upon intelligibility. U.S. 
Naval School Aviation Medicine, Naval 
Air Station, Pensacola; Ohio State Univ. 
Res. Found.; Off. Naval Res., USS. 
Naval School Aviation Medicine, Bur. 
Medicine Surgery Proj. NM 001 064- 
01.13, Joint Project Report, 13, 1952, 
PB 107685. 

Azzi, A., Gli impieghi della voce ritar- 
data in fonetica (The uses of delayed 
voice in phonetics). Rivista di Audio- 
logia Practica, 1952, 2, 75-85. 


. Azzi, A. Il Controllo auditivo della fon- 


azione (The auditory control of phona- 
tion). Rivista di Audiologia Practica, 
1956, 6, 65-88. 

Azz, A., La voce ritardata basi fisio- 
fisiche ed applicazioni pratiche di un 
nuovo dispositivo electtronico per scor- 
pire la sordita simulata (The delayed 
voice; psycho-physical bases and prac- 
tical applications of a new electronic 
technique for detecting simulated deaf- 
ness.) L’Oto-Rino-Laringol. Ital., 1952, 
21, 110-112. 


. Azzi, A., Le prove per svelare la simula- 


zione della sordita (A test for revealing 
simulated deafness). Rivista di Audio- 
logia Practica, 1951, 1, 22-23. 

Azz, A., Un nuovo metodo per svelare 
la simulazione della sordita (A new 
method for revealing the simulation of 
deafness). Archivio Italiano di Otologia 
Rinologia e Laringologia, 1951, 62, 329- 
334. 


21. Azzi, A., and INvernizzi, M., La prova 


della voce ritardata come mezzo diag- 
nostico nelle sordita psicogene (The 
‘delayed voice’ test as diagnostic means 
in cases of psychogenic loss). Rivista 
Oto-Neuro-Oftalmologica, 1951, 26, 1-5. 


22. Baccaro, P. M., A comparative study of 


23. 


24. 


cortical potentials of stutterers and non- 
stutterers under normal reading and de- 
layed speech feedback conditions. Un- 
pub. master’s thesis, Univ. Mississippi, 
1955. 

Barnes, N. S., Delayed feedback: voice 
variable alterations related to intelli- 
gence levels of mentally retarded adults. 
Unpub. master’s thesis, Purdue Univ., 
1957. 

Beaumont, J. T., and Foss, B. M., Indi- 
vidual differences in reacting to delayed 
auditory feedback. Briz. J. Psychol., 
1957, 48, 85-89. 


nN 
WM 


26. 


o 


30. 


SE, 


32. 


ae 


34. 


37. 


First: Feedback Bibliography 195 


. Brack, J. W., Studies of delayed side- 


tone. Pubblicazoni dell’ Universita Cat- 
tolica del Sacro Cuore, Nuova Serie, 
1955, 49, 71-91, Societa Editrice ‘Vita 
E Pensiero, Milano. 

Brack, J. W., Systematic research in ex- 
perimental phonetics: 2. signal recep- 
tion: intelligibility and side-tone. J. 
Speech Hearing Dis., 1954, 19, 140-146. 


. Brack, J. W., The effect of delayed 


side-tone upon vocal rate and intensity. 
J. Speech Hearing Dis., 1951, 16, 56-6C. 


. Brack, J. W., The effect of room char- 


acteristics upon vocal intensity and rate. 
J]. acoust. Soc. Amer., 1950, 22, 174-176. 


. Brack, J. W., The persistence of the 


effects of delayed side-tone. J. Speech 
Hearing Dis., 1955, 20, 65-68. 
Brupaker, R. S., An experimental in- 
vestigation of speech disturbance as a 
function of the intensity of delayed 
auditory feedback. Unpub. doctor’s dis- 
sertation, Univ. Illinois, 1952. 
Bruncarp, J., An experimental investi- 
gation into the use of PGSR as an ad- 
junct to delayed sidetone. Unpub. mas- 
ter’s thesis, Univ. Washinzton, 1956. 
Butter, R. A., and Gatroway, F. T., 
Factoral analysis of the delayed speech 
feedback phenomenon. J. acoust. Soc. 
Amer., 1957, 29, 632-635. 

Cuase, R. A., Effect of delayed audi-ory 
feedback on the repetition of p:ech 
sounds. J. Speech Hearing Dis., 1958, 23, 
583-590. 

Cuase, R. A., Sutton, S., and First, 
Darune, A developmental study of 
changes in behavior under delayed sud- 
itory feedback. Communications Lab., 
Dept., Otolarngology, Columbia Univ., 
Coll. Physicians Surgeons, Research Re- 
port, 1957. 


. Davinson, G. D., The effect of altered 


external side-tone transmission time 
upon oral reading rate, precision of 
articulation and pitch variability. Unpub. 
doctor’s dissertation, Ohio State Univ., 
1955. 


. Dorurinc, D. G., Changes in psycho- 


physiological responses produced by de- 
layed speech feedback, U.S. Naval 
School Aviation Medicine, Naval Air 
Station, Pensacola; Central Institute for 
Deaf, St. Louis; Cont. 1151(02), Proj. 
(Nr 146-092), U.S. Naval School Avia- 
tion Medicine, Bur. Medicine Surgery, 
Proj. NM 001 102 502, Report No. 1, 
1956. 

Doenrinc, D. G., and Harsoxp, G. J., 








196 Journal of Speech and Hearing Research 


38. 


39. 


40. 


41. 


43. 


45. 


46. 


47. 


48. 


The relation between speech disturb- 
ance and psychophysiological changes 
resulting from delayed speech feedback. 
U.S. Naval School Aviation Medicine, 
Naval Air Station, Pensacola; Central 
Institute for Deaf, St. Louis; Cont. 1151 
(02), Proj. (Nr 146-092) U.S. Naval 
School Aviation Medicine, Bur. Medi- 
cine Surgery Proj. NM 13 01 99, Subtask 
1, Report No. 5, 1957. 

Exxins, E., Effects of side-tone delay on 
oral reading responses under conditions 
of binaural and monaural stimulus pres- 
entation. Unpub. master’s thesis, Univ. 
Maryland, 1956. 

Exurot, C. M., Responses of hard of 
hearing subjects to delayed sidetone. 
Unpub. master’s thesis, Vanderbilt Univ. 
School Medicine, 1955. 

Ewertsen, H. W., Delayed speech test. 
Acta Oto-Laryngologica, 1955, 45, 383- 
387. 

Farrpanks, G., Selective vocal effects of 
delayed auditory feedback. J. Speech 
Hearing Dis., 1955, 20, 333-346. 


. Farrsanxs, G., Systematic research in 


experimental phonetics: 1. A theory of 
the speech mechanism as a servosystem. 
J. Speech Hearing Dis., 1954, 19, 133-139. 
Farrpanks, G., and Gutrman, N., Ef- 
fects of delayed auditory feedback upon 
articulation. J. Speech Hearing Res., 
1958, 1 (1), 12-22. 


. Farrsanks, G., and Jarcer, R., A device 
for continuously variable time delay of * 


headset monitoring during magnetic re- 
cording of speech. J. Speech Hearing 
Dis., 1951, 16, 162-164. 

Feverman, H., An experimental study 
of anticipated difficulty of oral reading 
as a function of the amount of exposure 
to delayed auditory feedback. Unpub. 
master’s thesis, Penn. State Univ., 1957. 
Fors, E. W., An experimental investiga- 
tion of the time delay interval which 
produces maximum speech disturbance 
during delayed auditory feedback. Un- 
pub. master’s thesis, Univ. Illinois, 1954. 
Fry, D. B., The experimental study of 
speech. In A. J. Ayers and others (Eds.), 
Studies in Communication: contributed 
to Communication Res. Center, Univ. 
College, London, 154-155. London: Mar- 
tin Secker and Warburg, 1955. 
Grssons, E. W., and Wincuester, R. A., 
A delayed sidetone test for detecting 
uniaural functional deafness. Arch. Oto- 
laryngol., 1957, 66, 70-78. 


49. 


50. 


51. 


Dz: 


53. 


54. 


55. 


56. 


57. 


58. 


59, 


60. 


Gotprars, W., and Braunstein, P., Re- 
actions to delayed auditory feedback in 
schizophrenic children. In P. H. Hoch 
and J. Zubin (Eds.), Psychopathology of 
Communication, 49-63. New York: 
Grune and Stratton, 1958. 

Grover, J. M., A study comparing a- 
daptation to stuttering with adaptation 
to auditory delayed speech feedback. 
Unpub. master’s thesis, Univ. Wichita, 
1956, 

Gutrman, N., An analysis of articula- 
tion as a function of delayed auditory 
feedback. Unpub. master’s thesis, Univ. 
Illinois, 1951. 

Gutman, N., Experimental studies of 
the speech control system. Unpub. doc- 
tor’s dissertation, Univ. Illinois, 1954. 
Ham, R. E., Certain effects on speech 
of alterations in the auditory feedback 
of speech defectives and normals. 
Unpub. doctor’s dissertation, Purdue 
Univ., 1956. 

Hantey, C. N., and Tirrany, W. R., 
Auditory malingering and psychogenic 
deafness. Arch. Otolaryngol., 1954, 60, 
197-201. 2 
Han ey, C. N., and Tirrany, W. R., An 
investigation into the use of electro- 
mechanically delayed side-tone in aud- 
itory testing. J. Speech Hearing Dis., 
1954, 19, 367-374. 

Hantey, C. N., Tirrany, W. R., and 
Brunearp, J. M., Skin resistance changes 
accompanying the sidetone test for aud- 
itory malingering. J. Speech Hearing 
Res., 1958, 1, 286-293. 

Harsotp, G. J., and Dorurine, D. G., 
A rating scale measure of speech dis- 
turbances that accompany delayed 
speech feedback. U. S. Naval School 
Aviation Medicine, Naval Air Station, 
Pensacola; Ohio State Univ. Res. Found.; 
Cont. N6ONR 22525, Off. Naval Res. 
Proj. NR 145-993, U.S. Naval School 
Aviation Medicine, Bur. Medicine Sur- 
gery Proj. NM 180299, Subtask 1, Re- 
port No. 71, 1957. 

Harrorp, E. R., The effect of loudness 
recruitment on delayed speech feed- 
back. Unpub. doctor’s dissertation, 
Northwestern Univ., 1958. 

Hersst, I., An experimental study of 
the effect of room characteristics and 
noise upon the rate and accuracy of 
reading familiar material. Unpub. mas- 
ter’s thesis, Ohio State Univ., 1951. 
Hut, F. M., An experimental investi- 
gation of speech disturbance as a func- 


- = t 


f 


ic 
0, 


O- 
d- 


Sc 


eS 


61. 


63. 


65. 


66. 


67. 


68. 


69. 


70. 


71. 


72. 


Chase, Sutton, 


tion of the frequency distortion of de- 
layed auditory feedback. Unpub. doc- 
tor’s dissertation, Univ. Illinois, 1952. 
Huntincton, D. A., An experimental 
investigation of the relationships be- 
tween auditory abilities and the speech 
disturbances produced by delayed aud- 
itory feedback. Unpub. doctor’s disserta- 
tion, Univ. Illinois, 1952. 


. Jorpan, E. P., A comparative investiga- 


tion of impromptu speaking and oral 
reading under conditions of delayed 
auditory feedback. Unpub. master’s 
thesis, Univ. Illinois, 1952. 

Katmus, H., Dengs, P., and Fry, D. B., 
Effect of delayed acoustic feed-back on 
some non-vocal activities. Nature, Lon- 
don, 1955, 175, 1078. 


. Kune, M. V., Guze, H., and Haccerrty, 


A. D., An experimental study of the 
nature of hypnotic deafness: effects of 
delayed speech feedback. J. clin. exp. 
Hypnosis, 1954, 2, 145-156. 

K6nic, G., Sprachverzégerungstest zum 
nachweis der simulation (Delayed 
speech test as a proof of malingering). 
Z. Laryngol. Rhinol. Otol., 1955, 34, 
748-751. 

Lee, B. S., Artificial stutter. J. Speech 
Hearing Dis., 1951, 16, 53-55. 

Lee, B. S., Effects of delayed speech 
feedback. J. acoust. Soc. Amer., 1950, 
22, 824-826. 

Lez, B. S., Some effects of side-tone 
delay. J. acoust. Soc. Amer., 1950, 22, 
639-640. 

LeirH, W. R., and Pronxo, N. H., 
Speech under stress: a study of its 
disintegration. Speech Monogr., 1957, 24, 
285-291. 

Lercue, E., and Nesset, E., Neue beo- 
bachtungen bei reihenuntersuchungen 
mit verzdgerter sprachriickkopplung 
(Lee-effekt) [Further observations in 
serial studies with delayed speech feed- 
back (Lee effect)]. Archiv ftir Ohren- 
Nasen-und Kehlkopf-Heilkunde, 1956, 
169, 505-508. 

Marpte, N. B., and Morritt, S. N., A 
device for the production of delayed 
side-tone. U.S. Naval School Aviation 
Medicine, Naval Air Station, Pensacola, 
Bur. Medicine Surgery, Res. Proj. NM 
001 064.01.08, Joint Project Report No. 
8, 1951. 

McCroskey, Jr., R. L., Some effects of 
anaesthetizing the articulators under con- 
ditions of normal and delayed side-tone. 
U.S. Naval School Aviation Medicine, 


rie 


74. 


75. 


76. 


the 


78. 


79. 


80. 


81. 


82. 


First: Feedback Bibliography 197 


Naval Air Station, Pensacola; Ohio 
State Univ. Res. Found.; Cont. N6ONR 
22525, Off. Naval Res., Proj. NR 145- 
993, U.S. Naval School Aviation Medi- 
cine Bur. Medicine Surgery, Proj. NM 
001 104 500, Joint Project Report No. 
65, 1956. 

Metrose, J., An experimental study of 
speech under conditions of delayed aud- 
itory feedback between one and five 
seconds. Unpub. master’s thesis, Univ. 
Illinois, 1953. 

Metrose, J.. The temporal course of 
changes in the amount of vocal dis- 
turbance produced by delayed auditory 
feedback. Unpub. doctor’s dissertation, 
Univ. Illinois, 1954. 

Mever-Eppter, W., Verzégerte riick- 
kopplung als mittel der stimmforschung 
(Delayed feedback as a means of speech 
investigation). Elektr. Rdsch., 1956, 10, 
91-93, 

Meyer-Eppter, W., and Lucusincer, R., 
Beobachtungen bei der verzdgerten 
riickkopplung der sprache (Lee-effekt) 
{Observations on the reactions to de- 
layed auditory feedback (Lee effect)]. 
Folia Phoniatrica, 1955, 7, 87-99. 
Neety, K. K., The effect of oral prac- 
tice in presence of different conditions 
of side-tone upon the rate and sound 
pressure level of the speech of a group 
of stutterers. Unpub. doctor’s disserta- 
tion, Ohio State Univ., 1951. 

Niwetz, M., An experimental study of 
speech under conditions of delayed aud- 
itory feedback: time delay between .01 
and .10 second. Unpub. master’s thesis, 
Univ. Illinois, 1953. 

Osurine, M., An analysis of relationships 
between measurement of reading skills, 
speech performances, speech attitudes 
and measurements of oral reading rate 
under conditions of delayed side-tone. 
Unpub. master’s thesis, Univ. Maryland, 
1956. 

Peters, R. W., The effect of changes 
in external side-tone delay and level 
upon rate of oral reading of normal 
speakers. Unpub. doctor’s dissertation, 
Ohio State Univ., 1953. 

Peters, R. W., The effect of changes 
in side-tone delay and level upon rate 
of oral reading of normal speakers. J. 
Speech Hearing Dis., 1954, 19, 483-490. 
Pronko, N. H., and Leirn, W. R., Be- 
havior under stress: a study of its dis- 
integration. Psychol. Rept., 1956, 2, 
205-222. 





198 


83. 


84. 


85. 


86. 


87. 


88. 


89. 


90. 


91. 


93. 


94. 





Journal of Speech and Hearing Research 


Quictey, S. P., The vocal effects of de- 
layed auditory feedback and their re- 
lationships to other individual character- 
istics. Unpub. doctor’s dissertation, Univ. 
Illinois, 1957. 

RABENSTEIN, Jr., J. E., Relationship be- 
tween responses to delayed auditory 
feedback and embedded-figures test. Un- 
pub. master’s thesis, Pennsylvania State 
Univ., 1958. 

Rawnsiey, A. I, and Harris, J. D., 
Comparative analysis of normal speech 
and speech with delayed side-tone by 
means of sound spectrograms. U. S. 
Naval Submarine Base, New London, 
Bur. Medicine Surgery, Navy Dept. 
Proj. NM 003 041.56.03, Medical Re- 
search Laboratory Report No. 248, 13 
(a), 1954. 

Rosinson, F. B., Effects of changes in 
the relationship between the speech and 
the external side-tone level on the oral 
reading rate of stutterers and non- 
stutterers. Unpub. doctor’s dissertation, 
Ohio State Univ., 1951. 

Scuusert, E. D., Research study of the 
psycho-acoustic effects of human and 
artificial sidetone. Univ. Iowa; Army; 
Proj. 3-99-12-022, Cont. DA36-039, sc- 
42562, Final Report, 1954, (Atkinson 
series). 

Scuwartz, R. J., Vocal responses to de- 
layed auditory feedback in congenitally 
blind adults. Unpub. doctor’s disserta- 
tion, Purdue Univ., 1958. 

SIEGENTHALER, B. M., and Bruspaker, R. 
S., Suggested research in delayed audi- 
tory feedback. Pennsylvania Speech An- 
nual, 1957, 14, 24-31. 

Sits, S. A.,:Some effects of delayed 
side-tone on the speech variables of 
children. Unpub. master’s thesis, Purdue 
Univ., 1957. 

Sprka, B., A study of relationships ex- 
isting between certain aspects of per- 
sonality and some vocal effects of de- 
layed speech feedback. Unpub. doctor’s 
dissertation, Purdue Univ., 1952. 


. Spitka, B., Relationships between certain 


aspects of personality and some vocal 
effects of delayed speech feedback. J. 
Speech Hearing Dis., 1954, 19, 491-503. 
SpitkA, B., Some vocal effects of differ- 
ent reading passages and time delays in 
speech feedback. J. Speech Hearing Dis., 
1954, 19, 37-47. 

Spitka, B., Hantey, T. D., and Steer, 
M. D., Relationships between certain 
aspects of personality and some vocal 


Os: 


96. 


97. 


98. 


99. 


101. 


effects of delayed speech feedback. 
Navy, Off. Naval Res., Spec. Devices 
Ctr., Specdevcen 104-2-37,; Purdue Univ. 
Specdevcen Proj. 20-F-8, Cont. No6ori- 
104, Technical Report, 1954. 
SPuEHLER, H. E., Effects and interactions 
of delayed side-tone and auditory flut- 
ter. Unpub. doctor’s dissertation, Pur- 
due Univ., 1956. 

Tenacuia, G., Contribrito ad un nuovo 
metado per svelare la sordita a mezzo 
della voce ritardata (New method of 
discovering deafness by means of de- 
layed speech). Giornale di Medicina 
Militaire, 1954, 104, 299-301. 

Tirrany, W. R., and Hantey, C. N., 
Adaptation to delayed side-tone. J. 
Speech Hearing Dis., 1956, 21, 164-172. 
Tirrany, W. R., and Han ey, C. N., 
Delayed speech feedback as a test for 
auditory malingering. Science, 1952, 115, 
59-60. 

Tirrany, W. R., Hantey, C. N., and 
SuTHERLAND, L. C., A simple mechanical 
adapter for variable side-tone delay. J. 
Speech Hearing Dis., 1954, 19, 504-506. 


. Waite, R., The relationship between 


oral reading .rate and the interval of de- 
layed auditory feedback causing maxi- 
mum disturbance. Unpub. master’s the- 
sis, Penn. State Univ., 1958. 
Wincuester, R. A., and Gissons, E. W., 
Relative effectiveness of three modes of 
delayed sidetone presentation. Arch. 
Otolaryngol., 1957, 65, 275-279. 


Related Papers 


I. 


Anperson, B. A., and Knaur, V. H., Two 
ways of increasing the clinical usefulness 
of a magnetic tape recorder. J. Speech 
Hearing Dis., 1949, 14, 37-42. 


. Atkinson, C. J., A study of vocal re- 


sponses during controlled aural stimula- 
tion. Unpub. doctor’s dissertation, Ohio 
State Univ., 1950. 


. Arkrnson, C. J., Vocal responses during 


controlled aural stimulation. J. Speech 
Hearing Dis., 1952, 17, 419-426. 


. Béxésy, G. v., The structure of the mid- 


dle ear and the hearing of one’s own 
voice by bone conduction. J. acoust. Soc. 
Amer., 1949, 21, 217-232. 


. Béxésy, G. v., Vibration of the head in 


a sound field and its role in hearing by 
bone conduction. J. acoust. Soc. Amer., 
1948, 20, 749-760. 


. Brack, J. W., A compensatory effect in 


vocal responses to stimuli of low intensi- 
ty. J. exp. Psychol., 1950, 40, 396-397. 





10. 


16. 


Chase, Sutton, First: Feedback Bibliography 199 


. Brack, J. W., Application of side-tone 


in subjective tests of microphones and 
headsets. U.S. Naval School Aviation 
Medicine, Naval Air Station, Pensacola; 
Ohio State. Univ. Res. Found.; Cont. 
N6ONR_ 22525, Off. Naval Res. Proj. 
Designation NR _ 145-993, U.S. Naval 
School Aviation Medicine, Bur. Medicine 
Surgery, Proj. NM 001 064.01.20, Joint 
Report No. 20, 1954, PB 114247. 


. Brack, J. W., Control of the sound pres- 


sure level of voice. U. S. Naval School 
Aviation Medicine, Naval Air Station, 
Pensacola; Ohio State Univ. Res. Found., 
Cont. N6ONR 22525, Off. Naval Res., 
Proj. Designation NR 145-993, U.S. Naval 
School Aviation Medicine, Bur. Medicine 
Surgery, Proj. NM 001 104 500.42, Joint 
Report No. 42, 1955, PB 117653. 


. Brack, J. W., Some effects of auditory 


stimuli upon voice. J. Aviat. Med., 1950, 
21, 251-255 and 277. 

Brack, J. W., Some effects upon voice of 
hearing tones of varying intensity and 
frequency while reading. Speech Monogr., 
1950, 17, 95-98. 


. Brack, J. W., The effect of noise-induced 


temporary deafness upon vocal intensity. 
Speech Monogr., 1951, 18, 74-77. 


. Brack, J. W., The loudness of side-tone. 


Speech Monogr., 1954, 21, 301-305. 


. Brack, J. W., The relation between mes- 


sage-type and vocal rate and intensity. 
Speech Monogr., 1949, 16, 217-220. 


. Brack, J. W., and Morritt, S. N., Pitch 


of side-tone. U.S. Naval School Aviation 
Medicine, Naval Air Station, Pensacola; 
Ohio State Univ. Res. Found.; Cont. 
N6ONR 22523, Off. Naval Res. Proj. 
Designation NR_ 145-993, U.S. Naval 
School Aviation Medicine, Bur. Medicine 
Surgery Proj. NM 001 064.01.31, Joint 
Project Report No. 31, 1954, PB 116530. 


. Brack, J. W., and Toruurst, G. C., In- 


telligibility as related to the path of air- 
borne side-tone. U.S. Naval School 
Aviation Medicine, Naval Air Station, 
Pensacola; Ohio State Univ. Res. Found.; 
Cont. N6ONR 22525, Office Naval Res. 
Proj. Designation NR 145-993, U.S. Naval 
School Aviation Medicine, Bur. Medicine 
Surgery Proj. NM 001 064.01.34, Joint 
Project Report No. 34, 1954. 

Buiss, E. L., Micron, C. J., Harpin 
Brancu, C. H., and Samuets, L. T., Re- 
action of the adrenal cortex to emotional 
stress. Psychosomatic Med., 1956, 18, 56- 
76 (p. 68). 


K7: 


26. 


nm 
MN 


Broapsent, D. E., Speaking and listening 
simultaneously. Med. Res. Council, Appl. 
Psychol. Res. Unit, Psychol. Lab., Cam- 
bridge, Eng., Report No. 146/151, 1951. 


. Broapsent, D. E., Speaking and listening 


simultaneously. J. exp. Psychol., 1952, 43, 
267-273. 


. Cuerry, C., and Sayers, B. McA., E£x- 


periments upon the total inhibition of 
stammering by external control, and some 
clinical results. J. Psychosomatic Res., 
1956, 1, 233-246. 


. Cutter, H. M., The effects of stress on 


the perception of time by normal and 
neurotic subjects. Unpub. doctor’s dis- 
sertation, Purdue Univ., 1951. 


. Drrrman, H. -H., The role of proprio- 


ceptive sensibilities in speech production. 
Unpub. doctor’s dissertation, Univ. Den- 
ver, 1955. 


. Dotcu, J. P., and Scuusert, E. D., Study 


of body-conducted sidetone. Univ. Iowa; 
Signal Corps; Cont. DA 36-039 sc-42562, 
Supplementary Report No. 6, 1954. 


. Dragcert, G. L., Relationships between 


voice variables and speech intelligibility 
in high level noise. Speech Monogr., 1951, 
18, 272-278. 


. Fairsanks, G., Systematic research in ex- 


perimental phonetics: 1. A theory of the 
speech mechanism as a servosystem. J. 
Speech Hearing Dis., 1954, 19, 133-139. 


. Frercuer, H., Rarr, G. M., and ParMLey, 


F., Study of the effect of different a- 
mounts of side-tone in the telephone set. 
Western Electric Co., Report No. 19412, 
Case No. 120622, 1918. 

FunkenstEIn, D. H., Kine, S. H., and 
DrotetrE, Marearer E., Mastery of 
Stress. Cambridge, Mass.: Harvard Univ. 
Press, 1957. 


. Grspons, E. W., Wincuester, R. A., and 


Kress, D. F., The variability of oral read- 
ing rate. J. Speech Hearing Dis., 1958, 
23, 591-593. 


. Gress, C. B., The continuous regulation 


of skilled response by kinaesthetic feed 
back. Brit. J. Psychol., 1954, 45, 24-39. 


. Granit, R., Systems for control of move- 


ment. Premier Congrés International des 
Sciences Neurologiques Bruxelles, 21-28 
Juillet 1957. Extrait du volume publié a 
loccasion de la premiére journée com- 
mune, 63-99, 


. Hammonpn, P. H., Merton, P. A., and 


Sutton, G. G., Nervous gradation of 
muscular contraction. Brit. Med. Bull., 
1956, 12, 214-218. 





31. Haney, T. D., and Srrer, M. D., Effect 


of level of distracting noise upon speak- 
ing rate, duration and intensity. J. Speech 
Hearing Dis., 1949, 14, 363-368. 

. Herrerwrne, R. F., The role of proprio- 
ception in the control of behavior. Trans. 
N.Y. Acad. Sciences, 1958, 2°, 739-764. 

. Korosow, N., Reactions to stress: a re- 
flection of personality trait organization. 
J. abnorm. soc. Psychol., 1955, 51, 464-468. 
. Korosow, N., The relationship between 
personality and reactions to audiogenic 
stress. Unpub. doctor’s dissertation, New 
York Univ., 1953. 

. Ligutroot, C., and Morritt, S. N., Loud- 
ness of speaking: the effect of the in- 
tensity of side-tone upon the intensity of 
the speaker. U.S. Naval School Aviation 
Medicine Res., Naval Air Station, Pensa- 
cola; Kenyon College; Cont. N7 onr-411 
T. O. L, Off. Naval Res., Proj. Designa- 
tion NR 782 004, U.S. Naval School 
Aviation Medicine Res., Bur. Medicine 
Surgery Proj. NM 001 053, Joint Project 
Report No. 4, 1949, PB 115196. 

. Matroy, M. M., Investigation of the re- 
lationship between the pitch of side-tone 
and the frequency of the voice. Unpub. 
master’s thesis, Ohio State Univ., 1953. 

. Peters, R. W., The effect of filtering of 
side-tone upon speaker intelligibility. J. 
Speech Hearing Dis., 1955, 20, 371-375. 

. Peters, R. W., The effect of high-pass 
and low-pass filtering of side-tone upon 
speaker intelligibility. U.S. Naval School 
Aviation Medicine, Naval Air Station, 
Pensacola; Ohio State Univ. Res. Found.; 
Cont. N6ONR 22525, Off. Naval Res. 
Proj. Designation NR 145-993, U.S. Naval 
School Aviation Medicine, Bur. Medicine 


200 Journal of Speech and Hearing Research 


Surgery Proj. NM 001 064.01.25, Joint 
Project Report No. 25, 1954. 

. Pererson, G. E., An oral communication 
model. Language, 1955, 31, 414-427. 

. Poutton, E. C., Simultaneous and alter- 
nate listening and speaking. J. acoust. Soc. 
Amer., 1955, 27, 1204-1207. 

. SmitH, W. M., Smiru, K. U., StTantey, 
R., and Hartey, W., Analysis of per- 
formance in televised visual fields: pre- 
liminary report. Percep. Motor Skills, 
1956, 6, 195-198. 

. Stromsta, C. P., A first approximation of 
the distance from vocal cords to cochlea 
and the transit time of bone-conducted 
sound from the region of the vocal cords 
to the region of the cochlea. Unpub. 
master’s thesis, Ohio State Univ., 1951. 

. VerzEANO, M., Time-patterns of speech 
in normal subjects. J. Speech Hearing 
Dis., 1950, 15, 197-201. 

. Verzeano, M., Time-patterns of speech 
in normal subjects. II. J. Speech Hearing 
Dis., 1951, 16, 346-350. 

. Wincuester, R. A., and Gissons, FE. W., 
The effect of auditory masking upon 
oral reading rate. J. Speech Hearing Dis., 
1958, 23, 250-252. 

. Woon, K. S., A preliminary study of 
speech deterioration under complete 
binaural masking. West. Speech, 1950, 14, 
38-40. 


Acknowledgment 

The authors gratefully acknowledge 
the generous support and assistance of 
Dr. Edmund P. Fowler, Jr., Dr. Joseph 
Zubin, Dr. Newman Guttman, Miss 
Gina Aversa, and Mr. Lloyd Gilden. 





